In previous posts on cognate identification, I discussed the difference between strict and loose cognates. Loose cognates are words in two languages that have the same or similar written forms. I also described how approaches to cognate identification tend to differ based on whether the data being used is plain text or phonetic transcriptions. The type of data informs the methods. With plain text data, it is difficult to extract phonological information about the language so approaches in the past have largely been about string matching. I will discuss some of the approaches that have been taken below the jump. In my next posting, when I get around to it, I will begin looking at some of the phonetic methods that have been applied to the task. (more…)
Posts Tagged ‘string matching’
Cognate Identification: Orthographic Methods
Posted: 26 January 2008 in UncategorizedTags: algorithms, cognate identification, cognates, computational linguistics, historical linguistics, language change, linguistics, machine translation, natural language processing, orthography, string matching
0


