About: Croatian National Corpus

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: Croatian National Corpus Goto Sponge NotDistinct Permalink

An Entity of Type : yago:Relation100031921, within Data Space : dbpedia.org associated with source document(s)
QRcode icon

http://dbpedia.org/describe/?url=http%3A%2F%2Fdbpedia.org%2Fresource%2FCroatian_National_Corpus

Croatian National Corpus (Croatian: Hrvatski nacionalni korpus, HNK) is the biggest and the most important corpus of Croatian. Its compilation started in 1998 at the Institute of Linguistics of the Faculty of Humanities and Social Sciences, University of Zagreb following the ideas of Marko Tadić. The theoretical foundations and the expression of the need for a general-purpose, representative and multi-million corpus of Croatian started to appear even earlier. The Croatian National Corpus is compiled from selected texts written in Croatian covering all fields, topics, genres and styles: from literary and scientific texts to text-books, newspaper, user-groups and chat rooms.

Attributes	Values
rdf:type	yago:WikicatCorpora yago:Abstraction100002137 yago:Assets113329641 yago:Capital113353607 yago:Possession100032613 yago:Principal113355868 yago:Relation100031921
rdfs:label	Croatian National Corpus (en)
rdfs:comment	Croatian National Corpus (Croatian: Hrvatski nacionalni korpus, HNK) is the biggest and the most important corpus of Croatian. Its compilation started in 1998 at the Institute of Linguistics of the Faculty of Humanities and Social Sciences, University of Zagreb following the ideas of Marko Tadić. The theoretical foundations and the expression of the need for a general-purpose, representative and multi-million corpus of Croatian started to appear even earlier. The Croatian National Corpus is compiled from selected texts written in Croatian covering all fields, topics, genres and styles: from literary and scientific texts to text-books, newspaper, user-groups and chat rooms. (en)
dcterms:subject	Corpora Croatian language Online databases Applied linguistics Linguistic research
Wikipage page ID	4744726 (xsd:integer)
Wikipage revision ID	1031867331 (xsd:integer)
Link from a Wikipage to another Wikipage	Corpora Croatian language University of Zagreb Institute of Croatian Language and Linguistics Croatian language Masaryk University Sketch Engine Faculty of Humanities and Social Sciences, University of Zagreb Online databases Applied linguistics Linguistic research Language corpus
Link from a Wikipage to an external page	http://filip.ffzg.hr/cgi-bin/run.cgi/first_form https://web.archive.org/web/20060306212835/http:/www.hnk.ffzg.hr/mt/ https://web.archive.org/web/20060424031437/http:/hnk.ffzg.hr/ https://web.archive.org/web/20120303182544/http:/riznica.ihjj.hr/
sameAs	Croatian National Corpus Croatian National Corpus Croatian National Corpus Croatian National Corpus Croatian National Corpus Croatian National Corpus
dbp:wikiPageUsesTemplate	dbt:In_lang dbt:Reflist dbt:Croatian_language dbt:Corpus_linguistics
has abstract	Croatian National Corpus (Croatian: Hrvatski nacionalni korpus, HNK) is the biggest and the most important corpus of Croatian. Its compilation started in 1998 at the Institute of Linguistics of the Faculty of Humanities and Social Sciences, University of Zagreb following the ideas of Marko Tadić. The theoretical foundations and the expression of the need for a general-purpose, representative and multi-million corpus of Croatian started to appear even earlier. The Croatian National Corpus is compiled from selected texts written in Croatian covering all fields, topics, genres and styles: from literary and scientific texts to text-books, newspaper, user-groups and chat rooms. The initial composition was divided in two constituents: 1. * 30-million corpus of contemporary Croatian (30m) where samples from texts from 1990 on were included. The criteria for inclusion of text samples were: written by native speakers, different fields, genres and topics. Translated text or poetry were excluded. 2. * Croatian Electronic Text Archive (HETA) where the complete text were included, particularly serial publications (volumes, series, editions etc.) which would imbalance the 30m if they were inserted there. Since 2004, with the adoption of the concept of the 3rd generation corpus, the two-constituent structure has been abandoned in favor of several subcorpora and larger size. Since 2005 HNK 105 million tokens and is composed of number of different subcorpora which can be searched individually and all together in a whole corpus. Since 2004 HNK also migrated to a new server platform, namely Manatee/Bonito server-client architecture. For searching the HNK (today still with free test access) a free client program Bonito is needed. The author of this corpus manager is Pavel Rychlý from the Natural Language Processing Laboratory of the Faculty of Informatics, Masaryk University in Brno, Czech Republic. Its interface features complex and more elaborated queries over corpus, different types of statistical results, total or partial word lists according to different query criteria (with their frequencies), frequency distribution of types, automatic collocation detection etc. The last version of this corpus (version 3) has 216.8 million tokens. The online search is available via web-interface search Bonito 2 which is a part of NoSketch Engine, limited version of the software Sketch Engine. (en)
prov:wasDerivedFrom	wikipedia-en:Croatian_National_Corpus?oldid=1031867331&ns=0
page length (characters) of wiki page	4582 (xsd:nonNegativeInteger)
foaf:isPrimaryTopicOf	wikipedia-en:Croatian_National_Corpus
is Link from a Wikipage to another Wikipage of	List of online databases Croatian Encyclopedic Dictionary HNK List of text corpora
is foaf:primaryTopic of	wikipedia-en:Croatian_National_Corpus

Faceted Search & Find service v1.17_git139 as of Feb 29 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3330 as of Mar 19 2024, on Linux (x86_64-generic-linux-glibc212), Single-Server Edition (378 GB total memory, 67 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software