PhD on Generative AI (KG-enhanced LLMs)

PhD on Generative AI (KG-enhanced LLMs)

```html

Are you eager to work on a combination of Large Language Models (LLMs) with Knowledge Graphs (KGs) to create trustworthy conversational AI? Do you want to have an impact on the world’s supplier to the semiconductor industry (ASML)?

Position: PhD student

Irène Curie Fellowship: No

Department(s): Mathematics and Computer Science

FTE: 1.0

Date off: 29/09/2024

Reference number: V32.7705

Job Description

This is a 4-year paid PhD position. The position will be with the Data and AI cluster at the Eindhoven University of Technology (TU/e) and ASML:

  • In the Data and AI cluster, we study the foundations of data and AI for the present and the future. We design new methods, develop algorithms and tools with a view to expanding the reach of databases and AI and their generalization abilities. In particular, we study foundational issues of robustness, safety, fairness, trust, reliability, tractability, scalability, interpretability, and explainability of data and AI. Currently, DAI includes five research groups: Uncertainty in AI, Generative AI, Automated ML, Data Mining, and Databases.
  • ASML, a leader in semiconductor manufacturing, faces challenges with limited and unbalanced data in metrology and diagnostics for their photolithography machines. Traditional approaches struggle with such data constraints. To address this, ASML explores foundation models, robust and adaptable models trained on extensive datasets. These models can effectively utilize small amounts of proprietary data, enhancing metrology and diagnostics accuracy. This innovation aligns with ASML's commitment to improving semiconductor manufacturing. By leveraging advanced machine learning techniques, ASML aims to optimize chip production, leading to higher yields and superior quality.

You will be supervised by Dr. J.M. Tomczak (TU/e), Prof. M. Pechenizkiy (TU/e), Prof. G. Fletcher (TU/e), and Dr. J. Kustra (ASML). You will be working in close collaboration with the Diagnostics & Data Science Group in ASML Research. This multidisciplinary team focuses on fundamentally exploring and prototyping the next generation knowledge-informed solutions for ASML, Metrology and Lithography challenges. Given the system complexity, a core challenge is in the diagnostics of (rarely occurring) failures, where the existing knowledge on system design is brought together with physics understanding as well as system data to reason on the problem potential root causes. You will participate in cutting-edge research, publish your work in leading conferences (NeurIPS, ICML, ICLR, AISTATS, UAI) and journals (TML, IEEE TPAMI, JMLR), and contribute to open-source tools.

You will work on developing a framework that will assist engineers in their diagnostics work and, consequently, shorten the downtime of a system. Additionally, the following assumptions are considered: (i) the framework must be conversational, i.e., an engineer must be able to check facts and procedures quickly, (ii) the framework must be trustworthy, namely, it cannot “hallucinate”.

We propose to formulate KG-enhanced LLMs that could serve for training, inference, and interpretability. LLMs are well-known for knowledge acquisition from large-scale systems and for achieving state-of-the-art performance on many natural language processing tasks. However, they can suffer from various issues, such as hallucinations, false references, and made-up facts. On the other hand, KGs can store enormous amounts of facts in a structured and explicit manner. However, unlike LLMs, formulating KGs is a laborious process, and querying KGs might be computationally demanding. One interesting research question is then the following: How to combine KGs and LLMs such that LLMs provide answers based on facts and do not hallucinate in any way? This could serve as a starting point for this Ph.D. project.

Job Requirements

  • BSc and MSc degree in Computer Science, Mathematics, or a closely related field.
  • Good statistical background and knowledge of probability theory, good understanding of Machine Learning and Deep Learning.
  • Programming in Python and PyTorch/Jax.
  • Fluent in spoken and written English, ideally demonstrated by tests (e.g., IELTS/TOEFL).
  • Ability to read scientific papers.
  • Ability to work in an interdisciplinary team and interested in collaborating with the industrial partner (ASML).

Conditions of Employment

A meaningful job in a dynamic and ambitious university, in an interdisciplinary setting and within an international network. You will work on a beautiful, green campus within walking distance of the central train station. In addition, we offer you:

  • Full-time employment for four years, with an intermediate evaluation (go/no-go) after nine months. You will spend 10% of your employment on teaching tasks.
  • Salary and benefits (such as a pension scheme, paid pregnancy and maternity leave, partially paid parental leave) in accordance with the Collective Labour Agreement for Dutch Universities, scale P (min. €2,872 max. €3,670).
  • A year-end bonus of 8.3% and annual vacation pay of 8%.
  • High-quality training programs and other support to grow into a self-aware, autonomous scientific researcher. At TU/e we challenge you to take charge of your own learning process.
  • An excellent technical infrastructure, on-campus children's day care and sports facilities.
  • An allowance for commuting, working from home and internet costs.
  • A Staff Immigration Team and a tax compensation scheme (the 30% facility) for international candidates.

Information and Application

About Us

Eindhoven University of Technology is an internationally top-ranking university in the Netherlands that combines scientific curiosity with a hands-on attitude. Our spirit of collaboration translates into an open culture and a top-five position in collaborating with advanced industries. Fundamental knowledge enables us to design solutions for the highly complex problems of today and tomorrow.

Curious to hear more about what it’s like as a PhD candidate at TU/e? Please view the video.

Information

Are you inspired and would like to know more about working at TU/e? Please visit our career page.

Do you recognize yourself in this profile and would you like to know more? Visit our website for more information about the application process or the conditions of employment. You can also contact Dr. J.M. Tomczak (j.m.tomczak@tue.nl), Prof. M. Pechenizkiy (m.pechenizkiy@tue.nl), Prof. G. Fletcher (g.fletcher@tue.nl), and Dr. J. Kustra (jacek.kustra@asml.com)

Application

We invite you to submit a complete application by using the apply button. The application should include a:

  • CV.
  • Detailed grades.
  • A cover letter (1 page max.) that briefly describes your motivation for your application, and what makes you a good candidate for this position. Make sure that your cover letter also mentions your grades for the courses you consider relevant to this position.
  • Contact details of two referees (incl. your supervisor), who may be contacted for more information.

We look forward to receiving your application and will screen it as soon as possible. The vacancy will remain open until the position is filled.

```

Job Overview

PhD on Generative AI (KG-enhanced LLMs)