Some information about my past research career.
- Generating Maths Word Problems and Solutions: investigating automatic generation of (1) mathematical word problems and (2) hints for their solutions in easy-to-follow steps similar to those produced by basic-skills tutors. See Math Stories.
- Rendering the semantic web accessible to people who want to create or modify semantic content but do not know ontology languages and logics. In the SWAT project I provided natural language support for ontology verbalisation and authoring, developing grammars and discourse-structuring algorithms for the SWAT verbaliser.
- Computer-generation of numerical quantity expressions. Numerical quantities are often presented poorly (especially in the media) which frustrates numerate people who are not given enough information while at the same time it perplexes innumerate people who do not understand basic mathematical concepts. I believe that it is possible to vary descriptions of numerical quantities to suit different audiences, situations and writing styles. I led an ESRC Small Project, NumGen, to investigate this area.
My other research interests are discourse structure and microplanning, generation for people with limited literacy, discourse analysis, speech act analysis, intonation analysis, generation of prosodically-annotated text, dialogue management in spoken language systems and automatic text summarisation.
Past Research Projects
- Williams, Sandra, Richard Power and Allan Third (2014) How Easy is it to Learn a Controlled Natural Language for Building a Knowledge Base? Fourth Workshop on Controlled Natural Language (CNL 2014), Galway, Ireland. To be published in Springer LNCS Series. preprint (192KB).
- Williams, Sandra and Richard Power (2013) Hedging and Rounding in Numerical Expressions. Journal of Pragmatics and Cognition, Volume 21, No. 1, pp. 193-223, John Benjamins Publishing Co. Preprint (362KB).
- Susana Bautista, Raquel Hervás, Pablo Gervás, Richard Powerand Sandra Williams (2013) A System for the Simplification of Numerical Expressions at Different Levels of Understandability. Proceedings of the workshop on Natural Language Processing for Improving Textual Accessibility (NLP4ITA), Atlanta, USA, pp.10-19. pdf
- Nguyen, Tu Anh T.; Power, Richard; Piwek, Paul and Williams, Sandra (2013). Predicting the understandability of OWL inferences. In: Extended Semantic Web Conference 2013 (ESWC 2013) – Research Track, 26th May to 30th May, 2013, Montpellier, France. pdf
- Sandra Williams (2013) An Information Extraction System for English Ontology Identifier Names. Technical Report TR2013/03, Department of Computing, The Open University. pdf (360KB)
- Sandra Williams (2013) An Analysis of POS Tag Patterns in Ontology Identifiers and Labels. Technical Report TR2013/02, Department of Computing, The Open University. pdf (385KB)
- Nguyen, Tu Ahn; Power, Richard; Piwek, Paul and Williams, Sandra (2012). Measuring the understandability of deduction rules for OWL. In: First International Workshop on Debugging Ontologies and Ontology Mappings (WoDOOM12), 8th October 2012, Galway, Ireland. pdf (313KB)
- Nguyen, Tu Anh T.; Power, Richard; Piwek, Paul and Williams, Sandra (2012). Planning accessible explanations for entailments in OWL ontologies. In: 7th International Natural Language Generation Conference (INLG 2012), 30 May-1 June 2012, Utica, IL, USA. pdf (273KB)
- Richard Power and Sandra Williams (2012) Generating numerical approximations. Computational Linguistics Volume 38, No. 1, pdf (749KB)
- Robert Stevens, James Malone, Sandra Williams, Richard Power, Alan Third (2011) Automating generation of textual class definitions from OWL to English. Journal of Biomedical Semantics, 2011 May 17, Vol. 2 Suppl 2:S5. pdf (1,278KB)
- Sandra Williams, Allan Third and Richard Power (2011) Levels of organisation in ontology verbalisation. Proceedings of the 13th European Workshop on Natural Language Generation, September 2011, Nancy, France, pp. 158-163. Paper: pdf (113KB). Poster: pdf (1.7MB).
- Sandra Williams (2011) Generating Mathematical Word Problems. Proceedings of the AAAI Fall Symposium on Question Generation, November 2011, Arlington, VA, USA, pp. 61-64. Paper: pdf (446KB). Slides: pdf (387KB).
- Susana Bautista, Raquel Hervás, Pablo Gervás, Richard Power, Sandra Williams (2011) Experimental Identification of the Use of Hedges in the Simplification of Numerical Expressions. The Second Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 30 July 2011, Edinburgh, Scotland. pdf (170KB)
- Susana Bautista, Raquel Hervás, Pablo Gervás, Richard Power, Sandra Williams (2011) How to Make Numerical Information Accessible: Experimental Identification of Simplification Strategies. In Part 1 Proceedings of 13th IFIP TC 13 International Conference on Human-Computer Interaction (INTERACT 2011), P. Campos, N. Graham, J. Jorge, N. Nunes, P. Palanque and M. Winckler (Eds.), Lecture Notes in Computer Science (LNCS) Volume 6946, pp. 57-64, September 5-9, DOI: 10.1007/978-3-642-23774-4. pdf (163KB)
- Tu Anh T. Nguyen, Richard Power, Paul Piwek and Sandra Williams (2011) Justification Patterns for OWL DL Ontologies. Technical Report TR2011-05, Department of Computing, The Open University. ISSN 1744-1986. pdf (288KB)
- Allan Third, Sandra Williams , Richard Power, (2011) OWL to English: a tool for generating organised easily-navigated hypertexts from ontologies. In: 10th International Semantic Web Conference (ISWC 2011), 23 – 27 Oct 2011, Bonn, Germany. Paper: pdf (208KB). Poster: pdf (2271KBG)
- Ehud Reiter and Sandra Williams  Generating Texts in Different Styles. In S Argamon, K Burns, S Dubnov (eds.) The Structure of Style: Algorithmic Approaches to Manner and Meaning. Springer-Verlag.Preprint, pdf (219KB)
- Sandra Williams and Richard Power  Grouping axioms for more coherent ontology descriptions. Proceedings of the 6th International Natural Language Generation Conference (INLG 2010), pp. 197-202 Dublin. pdf (116KB)
- Robert Stevens, James Malone, Sandra Williams and Richard Power  Automating Class Definitions From OWL to English. Bio-Ontologies 2010: Semantic Applications in Life Sciences SIG at 18th Annual International conference on Intelligent Systems for Molecular Biology (ISMB 2010), Boston. pdf(187KB)
- Sandra Williams and Richard Power  A fact-aligned corpus of numerical expressions. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC 2010), Malta, pp. 2396-2400. pdf (347KB)
- Sandra Williams and Richard Power [in preparation] Hedging and rounding in numerical expressions.
- Sandra Williams and Richard Power  Precision and mathematical form in first and subsequent mentions of numerical facts and their relation to document structure. Proceedings of 12th European Workshop on Natural Language Generation, Athens, March 30th – 31st. pdf (39KB)
- Sandra Williams and Ehud Reiter . Generating basic skills reports for low-skilled readers. Journal of Natural Language Engineering, Vol. 14, Issue 4, pp. 495-525, pdf (836KB).
- Sandra Williams and Ehud Reiter  SkillSum: basic skills screening with personalised, computer-generated feedback. Interactive Computer Aided Learning (ICL 2008), Special Track on Computer-based Knowledge & Skill Assessment and Feedback in Learning Settings (CAF 2008), September 2008, pages 1-8.pdf (366KB)
- Sandra Williams and Richard Power  Deriving rhetorical complexity data from the RST-DT Corpus. Proceedings of the 6th Language Resources and Evaluation Conference (LREC 2008), Marrakech, Morocco, 28-30 May, 2008. pdf (520KB). Additional data in fig. 3 of our poster pdf (101KB)
- Sandra Williams, Richard Power and Paul Piwek  Simulating emotional reactions in medical dramas. Proceedings of the Symposium on Affective Language in Human and Machine, Volume 2, The Society for the Study of Artificial Intelligence and the Simulation of Behaviour (AISB 2008) Convention: Communication, Interaction and Social Intelligence, Aberdeen, April 2008, pp. 25-32. pdf (215KB)
- Ehud Reiter and Sandra Williams  Three Approaches to Generating Texts in Different Styles.Proceedings of the Symposium on Style in text: creative generation and identification of authorship, Volume 7, The Society for the Study of Artificial Intelligence and the Simulation of Behaviour (AISB 2008) Convention: Communication, Interaction and Social Intelligence, Aberdeen, April 2008, pp. 26-33. pdf (190KB)
- Sandra Williams, Paul Piwek and Richard Power  Generating monologue and dialogue to present personalised medical information to patients. Proceedings of the 11th European Workshop on Natural Language Generation (ENLG’07), pp. 167-170. pdf (401KB)
- Paul Piwek, Richard Power and Sandra Williams . Generating scripts for personalised medical dialogues for patients. Technical Report 2006/06. Computing Department, The Open University. ISSN 1744-1986. pdf (87KB)
- Åhlfeldt, H., Borin, L., Daumke, P., Grabar, N., Hallett, C., Hardcastle, D., Kokkinakis, D., Mancini, C., Markó, K., Merkel, M., Pietsch, C., Power, R., Scott, D., Silvervarg, A., Toporowska Gronostaj, M., Williams, S., Willis, A. . Literature review on patient-friendly documentation systems. Technical Report no. 2006/04. Department of Computing, Faculty of Mathematics and Computing, The Open University. ISSN 1744-1986. pdf (706KB)
- Ehud Reiter, Sandra Williams and Lesley Crichton  Generating Feedback Reports for Adults Taking Basic Skills Tests. Proceeding of the The Twenty-fifth SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, Cambridge, UK. In A Macintosh, R Ellis, and T Allen (ed) Applications and Innovations in Intelligent Systems XIII (Proceedings of ES-05), pages 50-63. pdf (112KB)
- Sandra Williams and Ehud Reiter  Generating readable texts for readers with low basic skills. Proceeding of the 10th European Workshop on Natural Language Generation, Aberdeen, pages 140-147. pdf (118KB)
- Sandra Williams and Ehud Reiter  Appropriate Microplanning Choices for Low-Skilled Readers. Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, pages 1704-1708. pdf (93KB)
- Sandra Williams and Ehud Reiter  Deriving content selection rules from a corpus of non-naturally occurring documents for a novel NLG application. Proceedings of the Workshop on Using Corpora for Natural Language Generation, Information Technology Research Institute (ITRI) Technical Report, ITRI-05-03, University of Brighton, pages 41-48. pdf (157KB)
- Sandra Williams and Ehud Reiter  Reading errors made by skilled and unskilled readers: evaluating a system that generates reports for people with poor literacy. Poster at the Fourteenth Annual Meeting of the Society for Text and Discourse, Chicago. poster pdf (327KB), paper pdf (42KB)
- Sandra Williams and Ehud Reiter  Reading errors made by skilled and unskilled readers: evaluating a system that generates reports for people with poor literacy. University of Aberdeen Department of Computing Science Technical Report AUCS/TR0407, pages 1-6. pdf (123KB)
- Sandra Williams  Natural Language Generation (NLG) of discourse relations for different reading levels. PhD Thesis, University of Aberdeen.
- Sandra Williams  Language choice models for microplanning and readability. Proceedings of the Student Workshop of the Human Language Technology and North American Chapter of the Association for Computational Linguistics Conference (HLT-NAACL03 Student Workshop), Edmonton, pp. 13-18, May 2003.pdf (62KB)
- Sandra Williams, Ehud Reiter and Liesl Osman  Experiments with discourse-level choices and readability. Proceedings of the 9th European Workshop on Natural Language Generation, Budapest, pp. 127-134, April 2003. pdf (144KB)
- Ehud Reiter, Somayajulu Sripada and Sandra Williams  Acquiring and Using Limited User Models in NLG. Proceedings of the 9th European Workshop on Natural Language Generation, Budapest, pp. 87-94, April 2003. pdf (105KB)
- Sandra Williams and Ehud Reiter  A corpus analysis of discourse relations for Natural Language Generation. Proceedings of Corpus Linguistics 2003, pp. 899-908, Lancaster University, March 2003. pdf (277KB)
- Sandra Williams  Natural language generation of discourse connectives for different reading levels. The UK special interest Group for computational linguistics, 5th Annual CLUK. Research Colloquium, Leeds.pdf (248KB)
- Sandra Williams and Catherine I. Watson  A Profile of the Discourse and Intonational Structures of Route Descriptions. Proceedings of the 6th European Conference on Speech Communication and Technology, Eurospeech’99, September 5-9, 1999, Budapest, Hungary, Volume 4, pp. 1659-1662. pdf (43KB)
- Sandra Williams  Generating Pitch Accents in a Concept-To-Speech System Using a Knowledge Base. Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP’98), Volume 4, pp. 1159-1162, Sydney, Australia, 30th November – 4th December 1998. pdf (126KB)
- Robert Dale, Stephen Green, Maria Milosavljevic, Cécile Paris, Cornelia Verspoor and Sandra Williams  Dynamic Document Delivery: Generating Natural Language Texts on Demand. 9th International Conference and Workshop on Database and Expert Systems Applications. August 24-28, Vienna, Austria. pdf (325KB)
- Robert Dale, Stephen Green, Maria Milosavljevic, Cécile Paris, Cornelia Verspoor and Sandra Williams  Using Natural Language Generation Techniques to Produce Virtual Documents. Proceedings of the Third Australian Document Computing Symposium, August 21st, Sydney, Australia. pdf (477KB)
- Robert Dale, Stephen Green, Maria Milosavljevic, Cécile Paris, Cornelia Verspoor and Sandra Williams  The Realities of Generating Natural Language from Databases. 11th Australian Joint Conference on Artificial Intelligence, 12-17 July, Brisbane, Australia. pdf (444KB)
- Cornelia Verspoor, Robert Dale, Stephen Green, Maria Milosavljevic, Cécile Paris, and Sandra Williams  Intelligent Agents for Information Presentation: Dynamic Description of Knowledge Base Objects. In Proceedings of the International Workshop on Intelligent Agent on the Internet and Web, Mexico City, Mexico, 16-20 March 1998, pp. 75-86. pdf (404KB)
- Sandra Williams, Mark Harvey and Keith Preston  Rule-based reference resolution for unrestricted text using part-of-speech tagging and noun phrase parsing. Discourse Anaphora and Anaphor Resolution Colloquium (DAARC), Lancaster, U.K., July 1996 pdf (93KB)
- Sandra Williams  Anaphoric reference and ellipsis resolution in a telephone-based spoken language system for accessing email. Discourse Anaphora and Anaphor Resolution Colloquium (DAARC), Lancaster, U.K., July 1996. Also in Simon Botley and Anthony McEnery (eds.) Corpus-based and Computational Approaches to Discourse Anaphora, John Benjamins Publishing Company, ISBN 902722272X, 2000. pdf (73KB)
- Sandra Williams  Dialogue management in a mixed-initiative, cooperative, spoken language system. 11th Twente Workshop on Language Technology (TWLT11) Dialogue Management in Natural Language Systems, Enschade, Netherlands, June 1996 pdf (101KB)
- Peter Wyard, Alison Simons, Steve Appleby, Edward Kaneen, Sandra Williams and Keith Preston  Spoken Language Systems. BT Technology Journal, January 1996. pdf (444KB)
- Peter Wyard, Steven Appleby, Edward Kaneen, Sandra Williams and Keith Preston  A Combined Speech and Visual Interface to the BT Business Catalogue. ESCA Workshop on Spoken Dialogue Systems, 30th May – 2nd June 1995
- Keith Preston and Sandra Williams  Managing the Information Overload. Physics in Business, Institute of Physics, June 1994 pdf (20KB)