Centrum Wiskunde & Informatica (CWI) has a vacancy for a
4-year PhD position (m/f/x) on the subject of Fundamental Techniques in Table Representation Learning in collaboration with the University of Amsterdam.Interested in developing fundamental machine learning techniques for tabular data to democratize insights from high-value structured data? Then this fully-funded 4-year PhD position starting Fall/Winter 2025 is for you!Goal of the Table Representation Learning (TRL) LabApproximately 120 zettabytes of data has been collected worldwide but less than 1% is actually used. Structured data as found, for example, in tables, spreadsheets, and relational databases, is prevailing in organizations and typically informs important decisions in governments and humanitarian organizations, healthcare and finance. Yet, while AI has demonstrated a high impact on applications on text and images, proportional progress on tabular data is lacking. With the
TRL Lab (Table Representation Learning Lab), we aim to close this gap, by developing AI models and tools for tabular data, to help organizations, of any size, domain, and level of data literacy, get insights from structured data, efficiently, accurately and securely.
Goal of this PhD projectHigh-capacity neural models, such as transformers, have been pivotal for establishing general-purpose models for a wide variety of natural language tasks. Despite successful adaptations for structured data, our research has identified shortcomings for fundamental properties of tabular data. This research position will focus on exploring fundamental techniques for tabular-native models. This can involve, for example, studying new TRL model architectures, serialization and tokenization techniques, among others. A strong interest and background in AI and/or NLP are desired.
What you will be doing - Inform a research agenda on the PhD topic for a timespan of four years.
- Develop fundamental AI techniques, new TRL models, and systems specific for tabular data.
- Publish reusable software and data artifacts where relevant.
- Communicate research outcomes through papers and talks at conferences, workshops, and beyond.
- Actively collaborate with other researchers in the TRL Lab (students, 4-5 PhDs, postdocs, PI) and external collaborators (e.g. University of Amsterdam, the UN, and Amsterdam UMC).
- Assist in relevant teaching activities at universities, such as thesis supervision and assisting in courses.