Tag: corpora

Summary: Looking for Corpora…

Dear All, In this post I collect all the suggestions I got for the following request: “Looking for Corpora in….” http://www.forum.santini.se/2014/03/looking-for-corpora-to-explore-cross-linguality/ Big thanks to (hope I have not forgotten anybody): Johannes Heinecke, Dominika Rogozinska, Mohamed-Zakaria KURDI, Bartosz Zi√≥lko, Olga Whelan,…

Looking for Corpora to explore Cross-Linguality

Dear All, I am looking for corpora of any genre in the following languages:¬†English, Swedish, Polish, Italian, Finnish, Estonian, and Hungarian. I am already aware of a number of corpora (several posts in this blog are dedicated to the dissemination…

Summary: Multi-dimensional Social Network Datasets

Last Updated: 8 Oct 2012 Here is a summary of the suggestions received so far to the request for multi-dimensional social network datasets/corpora/collections (read the request here). Please do not hesitate to contact me for further suggestions. Suggestions Datasets: *…

Review: Creating Corpora With Active Learning

PhD thesis reviewed by Marina Santini Fredrik Olsson, Bootstrapping Named Entity Annotation by Means of Active Machine Learning: A Method for Creating Corpora. Doctoral thesis, University of Gothenburg, 2008 Download thesis from this page: http://soda.swedish-ict.se/3518/ The PhD thesis “Bootstrapping Named…