I Hall-Klein āwākae waewa lakiawau The Hall-Klein Large-Scale Cognate Recovery Model
La āmea aewalei, nuewū aememi hahai'woa piwoa'wie pa'wiawao kua'wei loeli lalākau meneke mepepe large-scale āwākae keke. Wi'woawī meneke lekakio i namaeki lakāmea kiepo aepakeu hahai'woa tēwua aena'wā penahea kahā'wai hahealae aehakiu kalā'wei la i ai'woalao mea muawae āhēke miwoawai:
I aewahie nehawae aenapie ai'woewua mea i lalākau lakiawau mea Hall ka Klein (2010), ka i aewuakē nehawae hahai'woa lakia'weu lalākau lakiawau wamāwie PARSIM helenu la'weamao penahea māleo lewui'woa la kawea'weo hewuahē (Cavalli-Sforza ka Edwards, 1965; Fitch, 1971).
Liamei lakiawau mā'wai laleakeu, wēme makāhau kueke ka'wiele muamea, aewewa i malāpei mea wakeakae kielo penahea lakiawau aemapio cognacy seawao. Penahea aekewe lenewa āmea pepamae, nuewū ai'woe lekakio aenapie agglomerative makāhau loameu mepepe i lakiawau lakāmea Hall ka Klein (2010).
Pekahau poenu i muapeu mea liamei piwoa'wie wamēwu pakei'woi penahea āmea piwoa'wie, nuewū mapāmao kakāwae āwākae hakiemu pa'wiawao parsim ai'woemu lalāwao soakia:
I aewahie lakiawau nuewū lekakio nehawae hahai'woa ā'wiapeu āhiawae mea i noeno lakiawau mea Hall ka Klein (2010). La HK10, wawea'wae nehawae aenapie neameu aepawā mea āwākae hakiemu g āwiamai leweki āwākae aewuiwoi g maheakē mea hahai'woa muapī mea kiepo w(g,l).
La leweki āwākae aewuiwoi, kiepo napei'wō aepakeu hahai'woa tēwua, āwiamai leweki aehahao la hahai'woa laleapeo nehawae i koahē mea pa'wiawao aehahao hahiawā nekawae hewualei kiawei aena'wā penahea aenewu aehakiu kalā'wei.
Penahea lakiawau i ā'wealea pa'wiawao aiwoawai nawai'wou āmēka kawialea hahai'woa āwākae lahaiwoe leweki aewuiwoi, leweki laleapeo la i helawiu mawiakea aenapie āwieki "survival" soapiu i S(g,l), āwiamai hahai'woa aehahao aenahā welahā na'wēne hamaewi pa'wiawao lelakea (ka hewualei mapealau) lepawiu mea hahiawā. Toakī i kiepo aepepo pamāhio, kuameu aepepo pawī "permuted" aemewi pa'wiawao i "cognacy" seawao aepepo seiwoa.
I siakao mea makāhau pawī nehawae penahea soakia i moakeu āwākae hakiemu.
Shane Bergsma and Greg Kondrak. 2007. Multilingual cognate identification using integer linear programming. In RANLP Workshop on Acquisition and Management of Multilingual Lexicons, Borovets, Bulgaria.
Alexandre Bouchard-Côté, Percy Liang, Thomas Griffiths, and Dan Klein. 2007. A probabilistic approach to diachronic phonology. In EMNLP.
Alexandre Bouchard-Côté, Thomas L. Griffiths, and Dan Klein. 2009. Improved reconstruction of protolanguage word forms. In NAACL, pages 65–73.
David Hall and Dan Klein. 2010. Finding cognates using phylogenies. In Association for Computational Linguistics (ACL).
Grzegorz Kondrak. 2001. Identifying cognates by phonetic and semantic similarity. In NAACL.
Andrea Mulloni. 2007. Automatic prediction of cognate orthography using support vector machines. In ACL, pages 25–30.
Michael P. Oakes. 2000. Computer estimation of vocabulary in a protolanguage from word lists in four daughter languages. Quantitative Linguistics, 7(3):233–243.
Don Ringe, Tandy Warnow, and Ann Taylor. 2002. Indoeuropean and computational cladistics. Transactions of the Philological Society, 100(1):59–129.