PhD on contextualizing generative AI systems in structured data (m/f/x)

PhD on contextualizing generative AI systems in structured data (m/f/x)

Published Deadline Location
27 Jun 20 Jul Amsterdam

You cannot apply for this job anymore (deadline was 20 Jul 2024).

Browse the current job offers or choose an item in the top navigation above.

Centrum Wiskunde & Informatica (CWI) has a vacancy in the Database Architectures research
group for a talented

Job description

Interested in making representation and generative learning work with structured data (e.g. tables
in spreadsheets and databases) to accurately, securely, and efficiently, democratize insights from
data? This 4-year PhD starting September 2024 is for you!

Goal of the DataLibra project
Approximately 120 zettabytes of data has been collected worldwide but less than 1% is actually
used. Structured data, e.g. tables, spreadsheets, and relational databases, is prevailing in
organizations and typically informs important decisions in healthcare, governments and finance.
Yet, while AI has demonstrated a high impact on applications on text and images, proportional
progress on structured data is lacking. With the DataLibra project, we aim to close this gap, by
developing AI models and tools for structured data (Table Representation Learning), to help
organizations, of any size, domain, and level of data literacy, get insights from structured data,
efficiently, accurately and securely.

Goal of this PhD project
Following recent developments in AI, large language models (LLMs) have been explored for data
analytics tasks (e.g. text-to-sql), but show limited accuracy in domain-specific contexts with
structured data. In this project, we will investigate and design interactive intelligent systems for
data analytics tasks, while accounting for two key challenges: trustworthiness of the outputs
(factuality), and security constraints of proprietary data contexts as in healthcare, enterprises, and
governments. Potential directions to explore are agentic systems, retrieval augmented generation,
(instruction) fine-tuning, and others.

What you will be doing
 Inform a research agenda on the PhD topic for a timespan of four years.
 Develop methods and systems for contextualizing generative AI for analytics over
structured data.
 Actively collaborate with other researchers in the DataLibra project (students, 4-5 PhDs,
postdocs, PI) and external collaborators (e.g. Amsterdam UMC, University of Amsterdam).
 Communicate research outcomes through papers and presentations at conferences,
workshops and other (scientific) gatherings.
 Assist in relevant teaching activities at universities, such as thesis supervision and assisting
in courses.

Specifications

Centrum Wiskunde en Informatica (CWI)

Requirements

  • A master (or equivalent) in computer science, artificial intelligence or similar.
  • Self-drive, creativity, rigor, ownership, and excitement to push research in TRL forward.
  • Theoretical knowledge of, or experience with, data management, human-computer

interaction, and/or machine learning such as representation and generative learning.
  • Programming skills, e.g. Python, Java, or C++.
  • Excellent command in English, verbal and written.
  • Prior experience as a research assistant during (under)graduate studies is advantageous

Diversity code
CWI encourages a diverse workforce: we endeavour to develop talent and creativity by bringing people from different backgrounds and cultures together. We recruit and select based on capabilities and talent. We strongly encourage everyone with the appropriate qualifications to apply for the vacancy, regardless of age, gender, origin, sexual orientation or physical abilities.

Conditions of employment

The terms of employment are in accordance with the Dutch Collective Labour Agreement for Research Centres ("CAO-onderzoeksinstellingen"). The initial labour agreement will be for a period of 18 months. After a positive evaluation, the agreement will be extended by 30 months. The gross monthly salary, for a PhD student on a full time basis, is € 2,781 during the first year and increases to € 3,562 over the four year period. Employees are also entitled to a holiday allowance of 8% of the gross annual salary and a year-end bonus of 8.33%. CWI offers attractive working conditions, including flexible scheduling and help with housing for expat employees.

Please visit our websites for more information about our terms of employment:
https://www.cwi.nl/jobs/terms-of-employment and
https://www.nwo-i.nl/en/working-at-nwo-i/jobsatnwoi/

Employer

Centrum Wiskunde & Informatica

Centrum Wiskunde & Informatica (CWI) is the Dutch national research institute for mathematics and computer science and is part of the Institutes Organisation of the Dutch Research Council (NWO). The mission of CWI is to conduct pioneering research in mathematics and computer science, generating new knowledge in these fields and conveying it to trade, industry, and society at large.

CWI is an internationally oriented institute, with 160 scientists from approximately 27 countries, an informal atmosphere and short lines of communication. We have an activity committee that organizes after-work activities and an informal women’s network.

CWI is located at Science Park Amsterdam, the home of AMS-IX, that is presently developing into a major location of research in the physical sciences in The Netherlands, housing the sciences of the University of Amsterdam as well as several other national research institutes next to CWI.

Specifications

  • PhD; PhD scholarship
  • Natural sciences
  • €2407—€3562 per month
  • Higher professional education
  • 3484

Employer

Centrum Wiskunde en Informatica (CWI)

Learn more about this employer

Location

Science Park 123, 1098XG, Amsterdam

View on Google Maps

Interesting for you