PhD on Generative AI (KG-enhanced LLMs)

PhD on Generative AI (KG-enhanced LLMs)

Published Deadline Location
30 Aug 29 Sep Eindhoven

You cannot apply for this job anymore (deadline was 29 Sep 2024).

Browse the current job offers or choose an item in the top navigation above.

Are you eager to work on a combination of Large Language Models (LLMs) with Knowledge Graphs (KGs) to create trustworthy conversational AI?

Do you want to have an impact on the world’s supplier to the semiconductor industry (ASML)?

Job description

This is a 4-year paid PhD position. The position will be with the Data and AI cluster at the Eindhoven Univ. of Technology (TU/e) and ASML:
  • In the Data and AI cluster, we study foundations of data and AI for the present and the future. We design new methods, develop algorithms and tools with a view at expanding the reach of databases and AI and their generalization abilities. In particular, we study foundational issues of robustness, safety, fairness, trust, reliability, tractability, scalability, interpretability and explainability of data and AI. Currently, DAI includes five research groups: Uncertainty in AI, Generative AI, Automated ML, Data Mining, and Databases.
  • ASML, a leader in semiconductor manufacturing, faces challenges with limited and unbalanced data in metrology and diagnostics for their photolithography machines. Traditional approaches struggle with such data constraints. To address this, ASML explores foundation models, robust and adaptable models trained on extensive datasets. These models can effectively utilize small amounts of proprietary data, enhancing metrology and diagnostics accuracy. This innovation aligns with ASML's commitment to improving semiconductor manufacturing. By leveraging advanced machine learning techniques, ASML aims to optimize chip production, leading to higher yields and superior quality.

You will be supervised by Dr. J.M. Tomczak (TU/e), Prof. M. Pechenizkiy (TU/e), Prof. G. Fletcher (TU/e), and Dr. J. Kustra (ASML). You will be working in close collaboration with the Diagnostics & Data Science Group in ASML Research. This multidisciplinary team focuses on fundamentally exploring and prototyping the next generation knowledge-informed solutions for ASML, Metrology and Lithography challenges. Given the system complexity, a core challenge is in the diagnostics of (rarely occurring) failures, where the existing knowledge on system design is brought together with physics understanding as well as system data to reason on the problem potential root causes. You will participate in cutting-edge research, publish your work in leading conferences (NeurIPS, ICML, ICLR, AISTATS, UAI) and journals (TML, IEEE TPAMI, JMLR), and contribute to open-source tools.

You will work on developing a framework that will assist engineers in their diagnostics work and, consequently, shorten the downtime of a system. Additionally, the following assumptions are considered: (i) the framework must be conversational, i.e., an engineer must be able to check facts and procedures quickly, (ii) the framework must be trustworthy, namely, it cannot 'hallucinate'.

We propose to formulate KG-enhanced LLMs that could serve for training, inference, and interpretability. LLMs are well-known for knowledge acquisition from large-scale systems and for achieving state-of-the-art performance on many natural language processing tasks. However, they can suffer from various issues, such as hallucinations, false references, and made-up facts. On the other hand, KGs can store enormous amounts of facts in a structured and explicit manner. However, unlike LLMs, formulating KGs is a laborious process, and querying KGs might be computationally demanding. One interesting research question is then the following: How to combine KGs and LLMs such that LLMs provide answers based on facts and do not hallucinate in any way? This could serve as a starting point for this Ph.D. project.

Specifications

Eindhoven University of Technology (TU/e)

Requirements

  • BSc and MSc degree in Computer Science, Mathematics, or a closely related field.
  • Good statistical background and knowledge of probability theory, good understanding of Machine Learning and Deep Learning.
  • Programming in Python and PyTorch/Jax.
  • Fluent in spoken and written English, ideally demonstrated by tests (e.g., IELTS/TOEFL).
  • Ability to read scientific papers.
  • Ability to work in an interdisciplinary team and interested in collaborating with the industrial partner (ASML).

Conditions of employment

A meaningful job in a dynamic and ambitious university, in an interdisciplinary setting and within an international network. You will work on a beautiful, green campus within walking distance of the central train station. In addition, we offer you:
  • Full-time employment for four years, with an intermediate evaluation (go/no-go) after nine months. You will spend 10% of your employment on teaching tasks.
  • Salary and benefits (such as a pension scheme, paid pregnancy and maternity leave, partially paid parental leave) in accordance with the Collective Labour Agreement for Dutch Universities, scale P (min. €2,872 max. €3,670).
  • A year-end bonus of 8.3% and annual vacation pay of 8%.
  • High-quality training programs and other support to grow into a self-aware, autonomous scientific researcher. At TU/e we challenge you to take charge of your own learning process.
  • An excellent technical infrastructure, on-campus children's day care and sports facilities.
  • An allowance for commuting, working from home and internet costs.
  • Staff Immigration Team and a tax compensation scheme (the 30% facility) for international candidates. 

Specifications

  • PhD
  • Engineering
  • max. 38 hours per week
  • University graduate
  • V32.7705

Employer

Eindhoven University of Technology (TU/e)

Learn more about this employer

Location

De Rondom 70, 5612 AP, Eindhoven

View on Google Maps

Interesting for you