The curriculum assumes no prior linguistic or programming knowledge and introduces students to a variety of computational methods and their theoretical underpinnings including: writing programs in Python to process raw texts (tokenization), discovering statistical patterns in linguistic data (frequency distribution), performing part-of-speech tagging, text segmentation, and classification (context-free grammars, dependency grammars), extracting meaning from texts, and applying various machine learning methods to data mining. The Certificate in Computational Linguistics is designed to provide academic training in the study of computational approaches to language analysis.