{"id":598,"date":"2018-02-27T15:46:37","date_gmt":"2018-02-27T14:46:37","guid":{"rendered":"https:\/\/dhlunch.ijp.pan.pl\/?page_id=598"},"modified":"2018-02-27T15:46:37","modified_gmt":"2018-02-27T14:46:37","slug":"09-03-2018","status":"publish","type":"page","link":"https:\/\/dhlunch.ijppan.pl\/en\/09-03-2018\/","title":{"rendered":"09.03.2018"},"content":{"rendered":"<p>Paragraphs and excerpts, or: 1830-1918 micro-corpus<\/p>\n<p>Magdalena Derwojedowa<\/p>\n<p>In my talk, I will present one-million corpus of Polish in 1830-1918, consisting of small samples of the texts from the period. It was built for the purposes of the project &#8220;Automatic morphological analysis of Polish texts from 1830-1918 period with respect to evolution of inflection and spelling&#8221; (DEC-2012\/07\/B\/HS2\/00570).<\/p>\n<p>I will start with presenting the microstructure of the corpus: sampling, metadata and source files, and I will briefly discuss the problems we encountered while working on the samples. In the second part, I will present the macrostructure of the corpus, its split into subcorpora and achieved variation of the samples. At the end I will present selected studies of linguistic phenomena that can be performed with the corpus.<\/p>\n<p>The corpus with the online search engine polyqarp is available online: <em> Search in dictionaries <\/em> (https:\/\/szukajwslownikach.uw.edu.pl\/f19\/).<\/p>","protected":false},"excerpt":{"rendered":"<p>Paragraphs and excerpts, or: 1830-1918 micro-corpus Magdalena Derwojedowa In my talk, I will present one-million corpus of Polish in 1830-1918, consisting of small samples of the texts from the period. &hellip; <a href=\"https:\/\/dhlunch.ijppan.pl\/en\/09-03-2018\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">09.03.2018<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-598","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/pages\/598","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/comments?post=598"}],"version-history":[{"count":0,"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/pages\/598\/revisions"}],"wp:attachment":[{"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/media?parent=598"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}