Hello Blackboard Guest, we are pleased to welcome you to Text Technologies for Data Science (2021-2022)[YR].
This year, this course is being taught by two lecturers, Walid Magdy (left) and Björn Ross (right), and a number of teaching support staff.

During semester 1 there will be lectures that, due to the size of this course, will take place online. You are encouraged to join live; recordings will also be made available. There will also be online drop-in lab sessions with multiple time slots that you can join at a time that is convenient for you. There will be two coursework assignments in semester 1.
During semester 2 you will work in small groups on coursework 3, supported by us. We encourage you to meet up in person for the group project if you feel comfortable doing so, but this is your decision.
This Learn page will be used for the submission of coursework. On the
public page of the course, you will be able to find lecture slides, lab instructions and coursework descriptions. Discussions about course content will take place on
Piazza. Drop-in labs will take place on
Microsoft Teams.
Learning Outcomes
On successful completion of this course, you should be able to:
1. Build basic search engines from scratch, and use IR tools for searching massive collections of text documents
2. Build feature extraction modules for text classification
3. Implement evaluation scripts for IR and text classification
4. Understand how web search engines (such as Google) work
5. Work effectively in a team to produce working systems