dbo:abstract
|
- Writeprint is a method in forensic linguistics of establishing author identification over the internet, likened to a digital fingerprint. Identity is established through a comparison of distinguishing stylometric characteristics of an unknown written text with known samples of the suspected author (writer invariants). Even without a suspect, writeprint provides potential background characteristics of the author, such as nationality and education. There are five broad aspects to author identification in writeprint:
* Lexical features - the analysis of the lexicon, the author's choice of vocabulary, using characters and words to identify preferences of an individual;
* use of uppercase and lowercase letters, frequency of certain letters, average length of word, mean length of the utterance itself
* Syntactic features - the analysis of the author's writing style and sentence structure, such as punctuation and hyphenation, use of passive voice, and sentence complexity;
* Structural features - the analysis of the author's organization and structural arrangement of the work, including paragraph length, spacing, and indentation.
* encompassing arrangement of sentences within paragraphs, use of farewells, greetings and signatures in an email setting, for example;
* Content-specific features - the analysis of the language that is contextually significant to subject of the written work, including the use of slang or acronyms. To be more specific, these features determine the interests of the subject by pinpointing keywords they use;
* Idiosyncratic features - the analysis of errors and other ungrammatical elements that may be unique to the author, such as incorrect spelling, misuse of words and inaccurate verb forms. Because this can be hard to control, it has achieved high accuracy in author identification when combined with other features. While the five features above are the traditional methods of author identification, there are features unique to online text. Features such as choice in font, the use of emojis, and links to other websites all provide a path to identification which is absent in traditional text analysis. (en)
- Skrivavtryck är en texts längd, innehåll, idiomatiska uttryck och design som kan användas för att jämföra om två texter skrivits av samma person. Precis som att fingeravtryck är spårbara är också skrivavtryck spårbart och kan användas för att spåra anonyma personer. kan man jämföra om två texter skrivits av samma person. Bildligt talat kan man säga att ett skrivavtryck är ett digitalt fingeravtryck. I takt med att fler anonymiseringsverktyg hittar ut på Internet kan skrivavtryck spela en större roll i framtiden. (sv)
- 筆紋(英語:writeprint)由司法語言學家提出,指作者獨特的寫作風格與特徵,概念與指紋類似。寫作特徵通常表現在詞彙的豐富度、句子長度、虛詞的使用、段落架構以及關鍵字的使用等方面。在作品出自一位筆者的情況下,可用來辨識文章作者。 筆紋像一樣,有助於指認數位媒體類型的罪犯。 (zh)
|