Vít Suchomel
Vít Suchomel
Masaryk University and Lexical Computing Ltd.
Verified email at mail.muni.cz
TitleCited byYear
The Sketch Engine: ten years on
A Kilgarriff, V Baisa, J Bušta, M Jakubíček, V Kovář, J Michelfeit, P Rychlý, ...
Lexicography 1 (1), 7-36, 2014
5222014
The tenten corpus family
M Jakubíček, A Kilgarriff, V Kovář, P Rychlý, V Suchomel
7th International Corpus Linguistics Conference CL, 125-127, 2013
1812013
Efficient web crawling for large text corpora
V Suchomel, J Pomikálek
Proceedings of the seventh Web as Corpus Workshop (WAC7), 39-43, 2012
772012
HindEnCorp-Hindi-English and Hindi-only Corpus for Machine Translation.
O Bojar, V Diatka, P Rychlý, P Stranák, V Suchomel, A Tamchyna, ...
LREC, 3550-3555, 2014
752014
SkELL: Web Interface for English Language Learning.
V Baisa, V Suchomel
RASLAN, 63-70, 2014
362014
Finding terms in corpora for many languages with the Sketch Engine
M Jakubíček, A Kilgarriff, V Kovář, P Rychlý, V Suchomel
Proceedings of the Demonstrations at the 14th Conference of the European …, 2014
342014
The sketch engine: Ten years on. Lexicography 1: 7–36
A Kilgarriff, V Baisa, J Bušta, M Jakubíček, V Kovář, J Michelfeit, P Rychlý, ...
312014
arTenTen: Arabic corpus and word sketches
T Arts, Y Belinkov, N Habash, A Kilgarriff, V Suchomel
Journal of King Saud University-Computer and Information Sciences 26 (4 …, 2014
282014
Text Tokenisation Using unitok.
J Michelfeit, J Pomikálek, V Suchomel
RASLAN, 71-75, 2014
192014
Recent Czech Web Corpora.
V Suchomel
RASLAN, 77-83, 2012
162012
Efficient web crawling for large text corpora
J Pomikalek, V Suchomel
Proceedings of the 7th Web-as-Corpus workshop, Lyon, France, 2012
142012
Large corpora for turkic languages and unsupervised morphological analysis
V Baisa, V Suchomel
Proceedings of the Eighth conference on International Language Resources and …, 2012
92012
arTenTen: a new, vast corpus for Arabic
Y Belinkov, N Habash, A Kilgarriff, N Ordan, R Roth, V Suchomel
Proceedings of WACL 20, 2013
82013
Building a 50M Corpus of Tajik Language.
G Dovudov, J Pomikálek, V Suchomel, P Smerk
RASLAN, 89-95, 2011
72011
Terminology Extraction for Academic Slovene Using Sketch Engine
D Fišer, V Suchomel, M Jakubícek
RASLAN 2016 Recent Advances in Slavonic Natural Language Processing, 135, 2016
62016
Japanese Language Lexical and Grammatical Profiling Using the Web Corpus JpTenTen
I Srdanović, V Suchomel, T Ogiso, A Kilgarriff
Proceeding of the 3rd Japanese corpus linguistics workshop. Tokyo: NINJAL …, 2013
62013
Web spam
A Kilgarriff, V Suchomel
Proc. 8th Web as Corpus Workshop, 2013
62013
HindEnCorp 0.5
O Bojar, V Diatka, P Straňák, A Tamchyna, D Zeman
Charles University, Faculty of Mathematics and Physics, Institute of Formal …, 2014
52014
chared: Character Encoding Detection with a Known Language.
J Pomikálek, V Suchomel
RASLAN, 125-129, 2011
42011
DSL Shared task 2016: Perfect Is The Enemy of Good Language Discrimination Through Expectation–Maximization and Chunk-based Language Model
O Herman, V Suchomel, V Baisa, P Rychlý
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties …, 2016
32016
The system can't perform the operation now. Try again later.
Articles 1–20