
Dr Alistair Willis
Senior Lecturer In Computing
School of Computing & Communications
Biography
Professional biography
Alistair Willis is a Senior Lecturer in Computing at the Open University. He holds a BA in Physics and Philosophy from the University of Cambridge, an MSc in Artificial Intelligence from the University of Edinburgh, and a DPhil in Computer Science from the University of York. After his DPhil, Alistair worked in the software engineering group at Philips Research Laboratories on automatic software testing, before joining the Open University's Computing Department in 2003.
Alistair leads the Artificial Intelligence and Natural Language Processing research group in the School of Computing and Communications, and leads the Data Science theme of the Institute of Coding.
Research interests
Alistair's research focuses on Natural Language Processing (NLP), and in particular how people interpret ambiguous text. At the Open University, he has investigated how to build computational models that predict when different people will disagree on the interpretation of ambiguous text. His work has looked at both the theoretical foundations of the phenomenon, and its potential impact in the area of requirements engineering.
He is also interested in the problem of how to recognise semantic similarity between texts. His research in this area applies inductive logic programming to learn systems for automatically grading students' written work. This work is currently being extended to use deep learning models for grading, as well as traditional symbolic methods.
The methods underlying these tasks are generalisable beyond NLP. Alistair has collaborated with computational musicologists on machine learning techniques to support automatic music composition for games, and with social scientists on using social media to understand the audience for the Russian television network RT.
Teaching interests
Alistair currently chairs the final year module Data Management and Analysis. This gives students a holistic view of the data lifecycle, developing a range of technical and discursive skills to enable them to use data to tell a story. By using open datasets, the module also attempts to demonstrate to students the possibility of using open data to effect change in the world.
Publications
Book Chapter
Detecting dangerous coordination ambiguities using word distribution (2007)
The availability of partial scopings in an underspecified semantic representation (2002)
Dataset
Digital Artefact
Journal Article
Seeing the smart city on Twitter: Colour and the affective territories of becoming smart (2019)
Developing and evaluating computational models of musical style (2016)
Mapping networks of influence: tracking Twitter conversations through time and space (2015)
The FuturICT education accelerator (2012)
A hybrid model for automatic emotion recognition in suicide notes (2012)
Towards the bibliography of life (2011)
Analysing anaphoric ambiguity in natural language requirements (2011)
Presentation / Conference
Bringing Timbral Shapes to Interactive Music Systems (2023)
Identifying Annotator Bias: A new IRT-based method for bias identification (2020)
Automatically calculating tonal tension (2020)
Towards a Cross-article Narrative Comparison of News (2020)
Developing Students' Written Communication Skills with Jupyter Notebooks (2020)
Agreement is overrated: A plea for correlation to assess human evaluation reliability (2019)
Evaluation methodologies in Automatic Question Generation 2013-2018 (2018)
Rethinking the Agreement in Human Evaluation Tasks (2018)
Search Personalization with Embeddings (2017)
Personalised Query Suggestion for Intranet Search with Temporal User Profiling (2017)
Adverse Drug Reaction Classification With Deep Neural Networks (2016)
Using NLP to support scalable assessment of short free text responses (2015)
Modelling time-aware search tasks for search personalisation (2015)
Temporal latent topic user profiles for search personalisation (2015)
Improving search personalisation with dynamic group formation (2014)
Methodological approaches to the evaluation of game music systems (2014)
Algorithmic music as intelligent game music (2014)
ComTax: community-driven curation for taxonomic databases (2013)
Literature-driven Curation for Taxonomic Name Databases (2013)
Curation tools for taxonomic databases (2013)
A generalised hybrid architecture for NLP (2012)
Speculative requirements: automatic detection of uncertainty in natural language requirements (2012)
Extending Nocuous Ambiguity Analysis for Anaphora in Natural Language Requirements (2010)
Automatic detection of nocuous coordination ambiguities in natural language requirements (2010)
A methodology for automatic identification of nocuous ambiguity (2010)
Using discovered, polyphonic patterns to filter computer-generated music (2010)
Improving search in scanned documents: Looking for OCR mismatches (2009)
Making tacit requirements explicit (2009)
On presuppositions in requirements (2009)
NP coordination in underspecified scope representations (2007)
Identifying nocuous ambiguities in natural language requirements (2006)
Disambiguating coordinations using word distribution information (2005)
Report
Literature Review on Patient-Friendly Documentation Systems (2006)
Can Online Learning Materials Improve Student Access to Digital Libraries? (2005)
Nocuous Ambiguities in Requirements Specifications (2005)
Using a Distributional Thesaurus to Resolve Coordination Ambiguities (2005)
Working Paper
Best Practices in using Technological Infrastructures (2020)