10 articles in this issue
Darja Fišer
Slovenšcina 2.0: "Computer-Mediated Communication"
Špela Arhar Holdt, Kaja Dobrovoljc
The main objective of this article is to assess the value of the Janes corpus for research in the field of language standardization. Unlike the existing reference corpora of written Slovenian, the newly available Janes corpus of user-generated content mos... see more
Tina Lengar Verovnik, Eva Vrtacic, Tanja Oblak Crnic
The article discusses several aspects of speech used by established political elites in the context of a developing digital tool Parlameter. This is done without understanding the internet and digital spaces as always already stylistically marked. Instead... see more
Darja Fišer, Tomaž Erjavec, Nikola Ljubešic
The paper presents the current version of the Slovene corpus of netspeak Janes which contains tweets, forum posts, news comments, blogs and blog comments, and user and talk pages from Wikipedia. First, we describe the harvesting procedure for each data so... see more
Marko Stabej, Helena Dobrovoljc, Simon Krek, Polona Gantar, Damjan Popic, Špela Arhar Holdt, Darja Fišer, Marko Robnik Šikonja
The transcript gives an account of the roundtable discussion that took place under the auspices of the conference Slovene On-Line and in New Media, on 27 November, 2015. Five distinguished scholars from the field of Slovene studies had been invited to par... see more
Špela Arhar Holdt
During 4 and 8 July 2016, the JANES Summer School and Summer Camp took place at the Faculty of Arts, University of Ljubljana. It was organized under the auspices of the Department of Translation and with financial suppport of the Slovenian research infras... see more
Urška Vranjek Ošlak, Mija Michelizza
The research of usernames in comments of online articles and on Twitter showed that users who comment online news tend to cover their identity more often than Twitter users. We assume that the reason for this is that online news commentators mostly expres... see more
Maja Milicevic, Nikola Ljubešic
In this paper we discuss the parallel manual normalisation of samples extracted from Croatian and Serbian Twitter corpora. We describe the datasets, outline the unified guidelines provided to annotators, and present a series of analyses of standard-to-non... see more
Tomaž Erjavec, Jaka Cibej, Darja Fišer
Web texts are becoming increasingly relevant sources of information, with web corpora useful for corpus linguistic studies and development of language technologies. Even though web texts are directly accessable, which substantially simplifies the collecti... see more
Dafne Marko, Iza Škrjanec, Jaka Cibej
28th European Summer School in Logic, Language and Information ESSLLI2016