— Last Updated: 27 Feb 2017 —
Log: Debriefing available (Jan 2016)
Marina Santini’s contact details: marinasantini dot ms at g-m-a-i-l
ML4LT is an online self-paced introductory course in Machine Learning for Language Technology. It has been designed for linguists and for undergraduate students in Computational Linguistics. The course includes 10 lectures, both theoretical and practical. The practical part relies on the Weka Machine Learning Workbench (free software). [See Lab1 for installation].
The content of this page is based on selected material from the course: “ML4LT: Machine Learning for Language Technology 2016, Undergraduate Students”, Uppsala University.
I will update this page regularly with links, videos, labs, assignments and literature. When visiting this page keep an eye on the “last updated” date. The course and the linked material will be updated and upgraded continuously.
Pre-Requirements: elements of statistics and probability theory
Disclaimer: All the video clips are also available on YouTube.
Lecture 5: k-Nearest Neighbours. Lab4. Reading: Daume’ III (2015: 26-32, excl. 2.4); Witten et al. (2011:131-138).
- A few words about ML4LT Assignments (Pdf)
- Assignment 1: Decision Trees and k-Nearest Neighbours
- Assignment 2: Naive Bayes
- Assignment 3: k-Means and Hierarchical Clustering
- Debriefing: Reflections on the experiments included in the assignments
– Hal Daumé III (2015). A Course in Machine Learning. Copyright © 2015.Only chapters specified in the timetable.
– Ian H. Witten, Eibe Frank, Mark A. Hall (2011). Data Mining: Practical Machine Learning Tools and Techniques. 3rd Edition. Morgan Kaufmann Publishers. Only chapters specified in the timetable. You can also use the 2nd edition (freely available online). (Fourth Edition is available in Europe in January 2017: http://www.cs.waikato.ac.nz/ml/weka/book.html).
– Petro Domingos (2012). A Few Useful Things to Know about Machine Learning. Communications of the ACM, 55(10), 78-87.
– Evaluation of Clustering in C. D. Manning, P. Raghavan & H. Schütze (2008). Introduction to Information Retrieval. Cambridge University Press, © 2008 Cambridge University Press. Website: http://informationretrieval.org/