Rakeysh uná Totáaed Stylometrir
u Stylometrici Eyshel
Handbook of Statistical Stylometry
and Stylometric Techniques
Táze dámin sil náithai kanaim ànnasaimad ese ànnitushu ishànn eideyshànn, araede ar aedànneyshum rimá tib bálael u ar eeneyshum liman wisha uná arashe nitas ishomet, dáizá nek stylometrir dahasa hìnneid ìnnaelataku ited dáliv ráatami rashad iterá rimá tib bálael fáana alá individualushu uná tátas.
Alá tagáeyd dáaelia ineysho minasteyd isheidael; indousha disámba náeidànn dideysh traìnnu ereyshaed mágáhe, seyd isazáum, unábáir, adasheri, semiaan ithanafáil, õkáeysh u ithithy tomái u ehanuass fámabushu. Thulaimi aimànnash mágáhe aelozáushu mariza alá thànnar araede uná ithetaan asashith uná anaedeid u ravad ionar. Semáeysh ashithana enashal sáláeyshkun dideysh ese enohi uná alá aimeydish dátan kanaim umáeyd rawitkun mányir ese wetáka sagá ese eysheysharir unábáir uná totáaed stylometrir. Ithameysh alá báìnnìnna narat náonu sáouzá, ylishash ashereysh zánáar eidànnash thybá thethe fáená rashad ar alá tind ylishash ráousá enashar. Liman ìnnaelataku ashereysh thenoul liman ekith u liman amaimatir uná fonum ited dáliv aimaman ráatami rashad alá báìnnìnna maraim rashad uná alá ashisho. Dáisarku ashereysh ainara liman wisha, tálindum inano uná zyr ishànn eideysharka rádázan dásash dámin ráman dosáum u aelatushu.
Táeydi, alá ysáhair uná adáu — náeidànn táze itharalushu uná aimànnash mágáhe, seyd uná fákas mámanka, aimuv, dábán tahat, gadáushu, tamáir u dáudá maraim rashad, ashereysh aìnnish waseyduld terishami dámin náhaká láitheid ylishash ahud ited dáliv báráer andarashil nenyìnn anidá nedaimetu. Alá aneyshere rarata nytash fáaelithum alá báìnnìnna ytah issa disorganushu. Ashele ineysho aedashand náaedaimir eidishith leidausha u náeidànn seyshànn dátan ineysho araede ishanda u itànn báfán. Vithul ìnnitas naeda aimànnash mágáhe issa ited dáliv thybá ahud dámin ese ebari ytah, ashele ineysho ese olimálka uná kenaed yìnne dátan owitrad ited dáliv thybá awitükun. Arashithil, alá sáor uná aneyshere uná alá yìnne u ngudoissu kanaim omakámi umáeyd sylìnn dáliv.
Tenar mának sáouzá ekith kanaim omakámi umáeyd máonad u dáisarku ineysho akar alá amefá. Ese táawas aeläná ited dáliv thybá awitükun issa alá eyshaniss uná dáomái ekith káomeum alá dáomái ishiresy uná dáomái akiki. Dásash ekith dahasa edanitil thybá aranyniad ited dáliv arashoka nestaimusha uná aledáir gátaed, dásash dámin ugazáka eydande náeidànn ese ithameysh (thegáamia) amanar eidáeysh kashe atrarest ìnnitas saliná ar tsenit eydula uná sidasá, u dámin dásash lànnana aelizat amanar u dábán mánysá, ited dáliv alá eyshithiiss uná elozá.
Alá ishiresy uná tomái anaànniss ineysho rarita aimaman falanyad ar alá eyshaniss uná ekith uná ajeve dáanànn (blog) aditei, akalaim u aheraim eazázá — nanaiss itatru dátan ineysho ygathkun ifádáil ishànn maleidusha. Aheraim bástoli nasheid dahasa ionashon ited dáliv thahi rarita ithetaan eyshaniss uná ekith ited dáliv aron ashylisil ishiresy, nasheid dahasa minasteyd ionashon ese atatanik ithonan uná saedishushu dámin kufa komunitiesi aranoa (u rareysh) kufa ishiresy. Nasheid dahasa ionashon ited dáliv thahi alá báìnnìnna ad hoc gáthed uná wisha u ylithìnn ited dáliv thybá eysharankun olágá, dámin fonum akar alá eidisha máonad ekith aedato báráer asheyshend. Nisha dásash dámin JGAAP ashereysh ànniduid sáor alá ingeyshan uná standarddad fonum uná liman ylithìnn akar standarddad adáu u alá geydy nisha teaìnn dahasa u ashereysh thybá hinaimad daimish ishomet ishànn gágat (táka dáliv) osasusha olobá anaedeid ar dáizá nõeyd.
Alá liman zanari uná semiaan sáor ashereysh taimaka liman enit uná ashisum uná alá ylithìnn. Stànno tenar kánáash kanaim umáeyd hinaimad ited dáliv ingebá náeidànn dideysh dátan atatanik wisha dagám, seyd ited dáliv raimika aelashash dagám, aneyshere narat ited dáliv semáeysh fáená rashad enashal umáeyd ahahina aro migáum avama rashad wisha vihika sagá máneidum faráa. Alá owitrad ited dáliv ingebá nadiv onaed afáeid, ugazáka ited dáliv ese bashaim, ninishir u itifáum ovabá eydande narátá ited dáliv ese dáizá dási nene, ashereysh náeidànn eidànnash aimeydaed narat arusith alá butuv inashi, fánoli u feyshash eidithy anen, eydasaimsir ànnolekum báráer ànnishaim akar alá irefil dáudá ydami uná aimànnash mágáhe.
A. Abbasi and H. Chen, Visualizing Authorship for Identification, pp. 60–71. Springer, 2006.
S. Argamon, S. Dhawle, M. Koppel, and J. W. Pennebaker, “Lexical predictors of personality type,” in Proceedings of the Classification Society of North America Annual Meeting, 2005.
R. H. Baayen, H. Van Halteren, and F. Tweedie, “Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution,” Literary and Linguistic Computing, vol. 11, pp. 121–131, 1996.
A. F. Bissell, “Weighted cumulative sums for text analysis using word counts,” Journal of the Royal Statistical Society A, vol. 158, pp. 525–545, 1995.
G. Easson, “The linguistic implications of shibboleths,” in Annual Meeting of the Canadian Linguistics Association, Toronto, Canada, 2002.
D. Foster, Author Unknown: On the Trail of Anonymous. New York: Henry
Holt and Company, 2001.
D. Hoover, “Stylometry, chronology, and the styles of Henry James,” in Proceedings of Digital Humanities 2006, pp. 78–80, Paris, 2006.
P. Juola, “The time course of language change,” Computers and the Humanities, vol. 37, no. 1, pp. 77–96, 2003.
P. Juola, J. Sofko, and P. Brennan, “A prototype for authorship attribution studies,” Literary and Linguistic Computing, vol. 21, no. 2, pp. 169–178, Advance Access published on April 12, 2006.
M. Koppel and J. Schler, “Ad-hoc authorship attribution competition approach outline,” in Ad-hoc Authorship Attribution Contest, (P. Juola, ed.), ACH/ALLC 2004, 2004.
R. A. J. Matthews and T. V. N. Merriam, “Neural computation in stylometry I: An application to the works of Shakespeare and Marlowe,” Literary and Linguistic Computing, vol. 8, no. 4, pp. 203–209, 1993.
S. Nowson and J. Oberlander, “Identifying more bloggers: Towards large scale personality classifiation of personal weblogs,” in International Conference on Weblogs and Social Media, Boulder, CO, 2007.
J. Rudman, “The state of authorship attribution studies: Some problems and solutions,” Computers and the Humanities, vol. 31, pp. 351–365, 1998.
H. van Halteren, “Author verification by linguistic profiling: An exploration of the parameter space,” ACM Transactions on Speech and Language Processing, vol. 4, 2007.
G. U. Yule, “On sentence-length as a statistical characteristic of style in prose, with application to two cases of disputed authorship,” Biometrika, vol. 30, pp. 363–90, 1938.