Description

The field of structural bioinformatics is shifting towards generating artificial proteins and designing them specifically for certain target functions. In this seminar, we will dive into the topics that awarded David Baker a Nobel Prize in Chemistry in 2024 (among Demis Hassabis and John Jumper for their contributions to protein structure prediction).

We will cover the latest advances in protein generation and design, with a focus on deep learning-based methods. The seminar will be structured as a block seminar in late September/early October, where students will present recent papers on the topic and work on a hands-on project related to their presentation topic.

✦ ✦ ✦

Requirements

We have no strict prerequisites but will prefer students who took at least one of the following courses:

Team

What Do You Need To Do?

How Are the Grades Computed?

Presentation
40%
Hands-On
40%
Participation
20%

Presentation (40%)

Assessed on clarity, depth of understanding, and the quality of your answers to audience questions.

Hands-On (40%)

You will try to reproduce a figure from the assigned paper. This will be individually discussed with the supervisor, based on the paper. Your submission should include the following two components:

Participation (20%)

Active engagement during other students' presentations. Ask questions — it counts toward your grade and improves the seminar for everyone.

Registration

This seminar is open to all students from computer science and bioinformatics. To apply, please write an email to roman.joeres@helmholtz-hips.de before May, 3rd 23:59. Please attach your current transcript of records (available via LSF/HISPOS).

After the registration deadline, all applicants will be informed of their participation status by email. We will then find a date for the kick-off meeting, where we will assign the presentation topics and discuss organizational details.

Organisational Details

📅 The seminar will be held as a block seminar in late September/early October.

🎓 Successful participants will earn 7 credit points (CP).

👥 Maximum number of participants: 12. 4 CS students, 8 bioinformatics students.

🌐 Seminar language: English.

📋 Registration in LSF/HISPOS is due on 02.06.2026.

For questions, contact roman.joeres@helmholtz-hips.de.

Important Dates

03.05.2026, 23:59 Registration deadline
12.05.2026, 15:00 Kick-off meeting — room 0.01, building ZBI (E2.1)
[DD.MM.YYYY, HH:MM] Presentation day(s)

Topics

  1. ProtGPT2 is a deep unsupervised language model for protein design Assigned to: Deekshitha Poobalan, Supervisor: Qingyuan Liu
  2. All-atom protein generation with latent diffusion Assigned to: Klea Sinjari, Supervisor: Qingyuan Liu
  3. Understanding protein function with a multimodal retrieval-augmented foundational model Assigned to: Nicolas Pham, Supervisor: Qingyuan Liu
  4. PB-GPT: An innovative GPT-based model for protein backbone generation Assigned to: Bhavyashree Vishwanatha, Supervisor: Roman Joeres
  5. Zero-shot prediction of mutation effects with mutlimodal deep representation learning guides protein engineering Assigned to: Malik Saad Wazir, Supervisor: Roman Joeres
  6. Structure-informed language models are protein designers Assigned to: Mariia Landau, Supervisor: Anastasiia Kolchina
  7. ProteinBERT: a universal deep-learning model of protein sequence and function Assigned to: Ioannis Mamalis, Supervisor: Anastasiia Kolchina
  8. MULAN: multimodal protein language model for sequence and structure encoding Assigned to: Ben Sievers, Supervisor: Anastasiia Kolchina
  9. Scaling unlocks broader generation and deeper functional understanding of proteins Assigned to: Ibrahim Nadiyan, Supervisor: Roman Joeres
  10. Simulating 500 million years of evolution with a language model Assigned to: Regina Nitsch, Supervisor: Hanqing Liu
  11. Robust deep learning-based protein sequence design using Protein MPNN Assigned to: Susan Maria Bino, Supervisor: Hanqing Liu
  12. Atomic context-conditioned protein sequence design using LigandMPNN Assigned to: Aya Ahmed, Supervisor: Hanqing Liu