The WebGenre Blog: The power of genre applied to digital information. By Marina Santini » Publications

Publications

Marina Santini’s Publications

Academic Books

Santini Marina (2011). Automatic Identification of Genre in Web Pages. A new perspective. LAP Lambert Academic Publishing. Paperback. Book Outline.

Mehler A., Sharoff S. and Santini M. (eds) (2010). Genres on the web: Computational Models and Empirical Studies. Springer Series: Text, Speech and Language Technology (Series Editors:Ide, Nancy, Véronis, Jean).

Book Reviews


Book Review (2011): Violeta Seretan, Syntax-Based Collocation Extraction. Springer, 2011, published on The WebGenreBlog.

Book Review (2011): Janet Giltrow and Dieter Stein (eds), Genres in the Internet. John Benjamin Publishing Company, Amsterdam-Philadelphia, published on Corpus Linguistics and Linguistic Theory (2009).

Book Review (2010): Bateman J. Multimodal Documents and Genre. Palgrave Macmillan. 2008. See also LINGUIST List 21.1606 Fri April 02 2010

Book Review (2010): Heyd T. Email Hoaxes – Form, Function, Genre Ecology John Benjamins. 2008. See also LINGUIST List 21.75, Thu Jan 07 2010

Book Review (2009): Marianne Hundt, Nadja Nesselhauf and Carolin Biewer (eds). Corpus Linguistics and the Web. Rodopi, 2007. See also Corpora. Volume 4, Page 209-211.

Book Review: Discourse on the move by D. Biber, U. Connor and T. Upton, Computational Linguistics,March 2009, Vol. 35, No. 1, Pages 105-107.

Book Review:Bruce I. (2008). Academic Writing and Genre. A Systematic Analysis LINGUIST List 19.3079, Fri Oct 10 2008

Book Review (2005) Janoschka, Anja, Web Advertising, New forms of communication on the Internet. Pragmatics & Beyond New Series 131. John Benjamins. 2004. “, LINGUIST List 16.1652, Mon May 23 2005.

Book Review (2004): Görlach, Manfred, Text Types and the History of English,  Trends in Linguistics. Studies and Monographs 139, Mouton de Gruyter 2004, LINGUIST List 15.3136, Mon Nov 08 2004.

Academic Blog Moderator

The WebGenre Blog (http://www.forum.santini.se/) is a meeting point for academia and industry.

Talks

Santini M. (2011). Computational Models for Automatic WebGenre Identification. Talk given at Stockholm University, Uppsala University, Borås University, Gothenburg Univerity.

Santini M. (2010). Identificazione automatica dei generi testuali sul web: Stato dell’arte. Tavola Rotonda PAISA’ – CiC, Universita’ di Bologna, 9 aprile 2010.

Santini M. (2009). “Making sense of different genre taxonomies”. Automated Document Genre Classification Workshop: Supporting Digital Curation, Information Retrieval, and Knowledge Extraction, 9 September 2009, Microsoft Research, Cambridge, UK.

Santini M. (2008). State of the Art in Automatic Genre Classification: Where do we go from here?.Seminar, Department of Computer Science, Information Retrieval, University of Glasgow, Glasgow, UK http://www.dcs.gla.ac.uk/research/groups/oneevent.cfm?eventid=2559.

Santini M. (2007). Supervised automatic web genre classification. Talk given at Stockholm University,

Editorial and Organizational Activities

2009

Coordinator and co-editor with Alexander Mehler and Serge Sharoff of “Genres on the Web: Computational Models and Empirical Studies”. Springer.

Coordinator and co-editor with Georg Rehm, Serge Sharoff and Alexander Mehler of the Special Issue on Genre on theJournal for Language Technology and Computational Linguistics (JLCL), Volume 24, Number 1, 2009.

2008

Co-editor with Georg Rehm, Alexander Mehler and Serge Sharoff of the WebGenreWiki, <http://purl.org/net/webgenres>.

2007

Co-organizer and co-chair with Serge Sharoff of the  Colloquium “Towards a Reference Corpus of Web Genres” (Friday, 27 July 2007) held in conjunction with Corpus Linguistics 2007, Birmingham, UK (http://corpus.leeds.ac.uk/serge/webgenres/colloquium/).

Co-organizer and co-chair with Georg Rehm: Workshop “Towards Genre-Enabled Search Engines: The Impact of NLP” (Sunday, 30 Sept. 2007) held in conjunction with RANLP, Borovets, Bulgaria (http://www.sics.se/use/genre-ws/).

Most Recent Papers, Articles, Chapters and Talks

Santini M., Sharoff S. and Mehler A. (2010) “Riding the Rough Waves of the Web”, Introduction. In Mehler A., Sharoff S and Santini M. (eds.), Genres on the web: Computational Models and Empirical Studies, Springer.

Santini M. (2011) “Cross-testing a Genre Classification Model for the Web”. In Mehler A., Sharoff S and Santini M. (eds.), Genres on the web: Computational Models and Empirical Studies, Springer.

Santini M. (2010) Identificazione automatica dei generi testuali sul web: Stato dell’arte. Tavola Rotonda PAISA’ – CiC, Universita’ di Bologna, 9 aprile 2010.

Santini M. and Sharoff S. (2009) “Web Genre Benchmark Under Construction”. Journal for Language Technology and Computational Linguistics (JLCL) 2009, volume 25, number 1 — Special Issue: Automatic Genre Identification: Issues, and Prospects”.

Santini M., Rehm G., Sharoff S. and Mehler A.  (2009) Editorial of the Special Issue: Automatic Genre Identification: Issues, and Prospects” (http://ldv-forum.org/2009_Heft1/Editorial.pdf) Journal for Language Technology and Computational Linguistics (JLCL) 2009, volume 25, number 1.

Santini M. (2009). “Making sense of different genre taxonomies”. Automated Document Genre Classification Workshop: Supporting Digital Curation, Information Retrieval, and Knowledge Extraction, 9 September 2009, Microsoft Research, Cambridge, UK.

Santini M. (2008). Cross-testing a Genre Classification Model. The second Swedish Language Technology Conference (SLTC-008). November 20 – 21, 2008, Stockholm. <http://www.speech.kth.se/sltc2008/abstracts/Cross-Testing_a_Genre_Classification_Model.pdf>

Santini M. (2008). State of the Art in Automatic Genre Classification: Where do we go from here?. University of Glasgow, Glasgow, UK <http://www.dcs.gla.ac.uk/research/groups/oneevent.cfm?eventid=2559>.

Santini M. (2008). “WebGenre and NLP: Identification of genres on the web through the processing of natural language”. Processing Text-technological Resources Conference”, Bielefeld University, Germany. <http://coli.lili.uni-bielefeld.de/Texttechnologie/Forschergruppe/PTTR/abstracts/Abstract-Santini.pdf>.

Santini M. and Rosso M. (2008). “Testing a Genre-Enabled Application: A Preliminary Assessment”, Proceedings of Future Direction in Information Access (FDIA-2008), BCS, London. <http://www.bcs.org/upload/pdf/ewic_fd08_paper7.pdf >

Rehm G., Santini M., Mehler M., Braslavski P., Gleim R., Stubbe A., Symonenko S., Tavosanis M. and Vidulin V. (2008). “Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems”, LREC 2008. Marrakech. <http://www.astro-susi.de/genre/lrec2008.pdf>

Sentiment Classification

Généreux M. and Santini M. (2007). Exploring the use of LinguisticFeatures in Sentiment Analysis. Corpus Linguistics 2007 – 27-30 July2007, Birmingham. http://www.nltg.brighton.ac.uk/home/Michel.Genereux/CL2007.pdf

Généreux M. and Santini M. (2007). Défi: Classification de Textes Français Subjectifs. 3ème DÉfi Fouille de Textes – 3rd July 2007, Grenoble. http://www.nltg.brighton.ac.uk/home/Michel.Genereux/deft2007-equipe3.pdf

Workshop Proceedings

Santini M. (ed.) (2007). “Abstracts – Proceedings of the Colloquium Towards a Reference Corpus of Web Genres”, Corpus Linguistics, Birmingham, 2007, (http://corpus.leeds.ac.uk/serge/webgenres/colloquium/Proceedings.pdf).

Rehm G. and Santini M. (eds.) (2007). “Proceedings of the International Workshop Towards Genre-Enabled Search Engines: The Impact of NLP” , Bovorets, Bulgaria (http://www.sics.se/use/genre-ws/RANLP-Genre-Workshop-Rehm-Santini_final.pdf).

Publications related to my PHD Research

Peer-Reviewed Journal Articles

Santini M. (2008). “Zero, Single, or Multi? Genres of Web Pages through the Users’ Perspective”. Information Processing & Management. Volume 44, Issue 2, March 2008,  pp. 702–737.

Santini M. (2006). “Web pages, text types, and linguistic features: Some issues”. ICAME Journal, Vol. 30, pp. 67-86.

Peer-Reviewed Conference/Workshop Papers

Santini M. (2007). “Automatic Genre Identification: Towards a Flexible Classification Scheme”. BCS IRSG Symposium: Future Directions in Information Access 2007 (FDIA 2007), Tuesday, 28th and Wednesday, 29th of August, Glasgow, Scotland. Held in conjunction with the European Summer School on IR (ESSIR 2007).

Santini M. (2007). “Characterizing Genres of Web Pages: Genre Hybridism and Individualization”. Proceedings of the 40th Hawaii International Conference on System Sciences (HICSS-40). Hawaii (USA).

Santini M., Power R. and Evans R. (2006). “Implementing a Characterization of Genre for Automatic Genre Identification of Web Pages”. Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (ACL/COLING 2006). Main Conference Poster Paper. Sydney (Australia).

Santini M. (2006). “Common Criteria for Genre Classification: Annotation and Granularity”. Proceedings of the Workshop on Text-based Information Retrieval (TIR-06) (held in conjunction with ECAI 2006). Riva del Garda, (Italy).

Santini M. (2006). “Some issues in Automatic Genre Classification of Web Pages”. Actes des 8èmes Journées Internationales d’Analyse Statistique des Données Textuelles (JADT 2006). Besançon (France).

Santini M. (2006). “Identifying Genres of Web Pages”. Actes de la 13ème Conference sur le Traitement Automatique des Langues Naturelles (TALN 2006). Leuven (Belgium).

Santini M. (2006). “Interpreting Genre Evolution on the Web”. Proceedings of the Workshop on NEW TEXT. Wikis and blogs and other dynamic text sources (held in conjunction with EACL 2006). Trento (Italy).

Santini M. (2005). “Automatic Text Analysis: Gradations of Text Types in Web
Pages”. Proceedings of the 10th ESSLLI Student Session (17th European Summer School in Logic, Language and Information). Edinburgh, Scotland (UK).

Santini M. (2005). “Building on Syntactic Annotation: Labelling Subordinate
Clauses”. Proceedings of the Workshop on Exploring Syntactically Annotated Corpora (held in conjunction with Corpus Linguistics 2005 Conference) Birmingham (UK).

Santini M. (2005). “Clustering Web Pages to Identify Emerging Textual Patterns”. TALN & RECITAL 2005 (Tome 1 – Conférences principales). “Posters RECITAL”, pp. 703-708. Dourdan (France).

Santini M. (2005). “Genres In Formation? An Exploratory Study of Web Pages using Cluster Analysis”. Proceedings of the 8th Annual Colloquium for the UK Special Interest Group for Computational Linguistics (CLUK 2005). Manchester, (UK).

Santini M. (2004). “Identification of Genres on the Web: a Multi-Faceted Approach”. Proceedings of the 26th European Conference on Information Retrieval (ECIR 2004). Poster Paper. Sunderland (UK).

Santini M. (2004). “A Shallow Approach To Syntactic Feature Extraction For Genre Classification”. Proceedings of the 7th Annual Colloquium for the UK Special Interest Group for Computational Linguistics (CLUK 2004). Birmingham (UK).

Technical Reports

Santini M. (2005). “Linguistic Facets for Genre and Text Type Identification: A
Description of Linguistically-Motivated Features”. Technical Report
ITRI-05-02
. NLTG, University of Brighton, Brighton (UK). <http://www.itri.brighton.ac.uk/techindex.html>

Santini M. (2004). “State-of-the-art on Automatic Genre Identification”. Technical Report ITRI-04-03, 2004. NLTG, University of Brighton, Brighton (UK). <http://www.itri.brighton.ac.uk/techindex.html>

Presentations and Posters

Santini M. (2006). “Deriving web genres from text types: A corpus-based approach”. Presentation at AAACL 2006 Conference, Flagstaff, AZ (USA). <http://www.nau.edu/english/AAACL/AAACL.htm>

Santini M. (2005). “Annotated corpora vs. raw web page collections. Text types, web pages, and Linguistic features: Some issues”. Presentation at AAACL-6/ICAME-26 Conference, Ann Arbor, Michigan (USA).

Santini M. (2004). “Identification of Genres on the Web” Research Student Poster Session at the Semantic Interoperability and Data Mining in Biomedicine Summer school and Workshop <http://mcs.open.ac.uk/semantic-mining/posters.html>, Hotel Füred, Balatonfüred, Hungary.

Comments are closed.