Rosetta Commons Research Experience for Undergraduates (REU)

AI for Biomolecular Structure Prediction and Design

Interns in this geographically-distributed REU program participate in research using the Rosetta Commons software. The Rosetta Commons software library includes algorithms for computational modeling and analysis of protein structures. It has enabled notable scientific advances in computational biology, including de novo protein design, enzyme design, ligand docking, and structure prediction of biological macromolecules and macromolecular complexes.

The REU program is pending renewal, with a decision anticipated for the 2026 award year.

Apply Here!

Application Guide

FAQ

"I gained a great mentor in my PI as well as good relationships with the rest of the lab members. I also gained lots of computational skills and professional connections through the Rosetta Commons and my cohort."

"Before the program I was only planning to apply to master's programs in biotech and comp bio however the internship has made me change into applying towards a PhD program."

"I was very comfortable speaking and interacting with the Rosetta community, which was a personal gain for me because I struggle with aspects of verbal communication and often get clammed up/stay quiet."

"I really, really enjoyed working with my mentor. He provided an ideal level of structured independence, in which I was allowed to ask and pursue my own questions while still receiving feedback on how to go about answering them and being able to discuss potential biological interpretations or implications of the results in an open environment.

"I never considered doing research seriously previously, but I really enjoyed being in the lab environment so I decided it may be a worthwhile future path."

“The research experience helped me understand what I want to do in the future. Particularly meeting experts in the field during the conference was a major reason for this decision.”

“I feel more confident in my abilities to try out different fields of research and be successful in them. I also feel more secure in my abilities to communicate my science to others who are not as experienced as well as asking questions to those considered experts in the field.”

"I gained much more confidence in my technical computational skills and ability to work independently. As I was posted in Europe, I also took the opportunity to travel independently which increased my autonomy and self-confidence significantly. This was a life changing experience and am beyond grateful to participate!”

The Program Eligibility Application

The Program

Program Dates: June 1, 2026 - August 7, 2026
The program starts with one week of Rosetta Code School (June 2 through June 6), where you will learn the inner details of the PyRosetta code and community coding environment, so you are fully prepared for the summer!
8 weeks of hands-on research in a molecular modeling and design laboratory, developing new algorithms and discovering new science.
The summer will finish with a trip to the Rosetta Conference in the gorgeous Cascade Mountains of Washington State, where you will present your research in a poster and connect with Rosetta developers from around the world. The conference will be held from August 5 through August 8.
This program is supported by NSF (Award 2244288). Interns will receive housing, paid travel expenses, a sustenance allowance and a $6,000 stipend.

Eligibility

U.S. citizens, permanent residents, and U.S. nationals, are eligible to apply.
International students, who are actively pursuing a bachelor’s degree in the United States, are eligible to apply.
International students studying outside of the US are not eligible to apply.
College Sophomores or Juniors are preferred.
Major in computer science, engineering, mathematics, chemistry, biology, and/or biophysics
Available during the program dates: June 1, 2026 - August 7, 2026.
- If accepted, students on the quarter system can request to take their final exams early or request have final exams proctored at Johns Hopkins.
Interest in graduate school.
While not required, we seek candidates with some combination of experiences in scientific or academic research, C++/Python/*nix/databases, software engineering, object-oriented programming, and/or collaborative development (git)

**Students graduating before the start of the program are not eligible for the REU and are encouraged to apply to our RaMP Program.

Application

Include the following in the application:
- Resume
- Transcript
- Personal statement—why this internship interests you—brief summary of research and computing experience—why you are an appropriate candidate for the internship.
- Two references (complete the reference forms in the application with contact information)
- Select top five labs and projects of interest from the list below.
Deadline for receipt of applications is February 1, 2026.
Deadline for receipt of recommendation letters is February 4, 2026.
Click here for Creating a Competitive REU Application for help with preparing your application.
Program contact: cmathis@jhu.edu

Available Projects and Locations

Cooper Lab@ Northeastern University in Boston, MA

"Citizen Science and Games for Biochemistry"

We are exploring how citizen science and crowdsourcing through video games can help biochemists with their work. To do this, we have developed the game Foldit, a multiplayer online game that allows players
without previous experience in biochemistry to work on protein folding and design problems. This project will focus on development of game-related aspects to understand and improve the player experience.
Potential projects include virtual reality, dynamic difficulty adjustment, and puzzle generation. Projects may incorporate artificial intelligence and machine learning.

Drew Lab @ University of Illinois at Chicago in Chicago, IL

"Design of protein binders to inhibit and stabilize protein interactions to disrupt viral lifecycles"

Viruses, such as Human Immunodeficiency Virus (HIV), are highly disruptive to society in terms of health care costs and lives. The lifecycle of HIV involves the dynamic packaging and unpackaging of its genome into a capsid of proteins. Over stabilizing the assembly of the capsid proteins is hypothesized to disrupt the proper assembly or disassembly of the capsid/matrix and therefore inhibit viral lifecycle. We propose to utilize deep learning protein design methods to develop high affinity binders targeting an assembly of capsid proteins thereby over stabilizing them.

Gray Lab @ Johns Hopkins University in Baltimore, MD

“Antibody engineering with deep learning”

Antibodies are critical molecules in the immune system and as pharmaceuticals. But many deep learning methods perform weakly on them because of their great diversity and reliance on loop structures, which have fewer data available. In this project you will create or apply new approaches such as language models, diffusion or flow models, toward predicting and designing better antibodies.

Hosseinzadeh Lab @ University of Oregon in Eugene, OR

"Protein engineering with deep learning"

In most hub proteins, such as P53, the region that binds to different proteins lacks a 3D shape, but gets its shape upon binding to different targets: S100B, CPB, SIRT1. When it is cancerous, its affinity changes to different protein targets, depending on which protein it is binding to. We are trying to design inhibitors against targets rather than the P53 itself to study the role of each one of them alone.

Institute for Protein Design @ University of Washington in Seattle, WA

"Protein design using generative models"

Combinations of novel AI design methods and the generative nature of RFdiffusion are converting previously intractable protein modeling and design problems into tasks that scientists can solve within weeks. Students will learn cutting edge deep learning protein design methods, and apply them to current design challenges. Areas of focus will include de novo enzyme design and de novo binder design. By the end of the experience, students will understand the computational pipeline of binder design and protein engineering experimental techniques. They will develop proficiency in molecular modeling, machine learning, and protein structure prediction techniques.

Alternative dates for this location: June 1- June 5 for training and June 21 - August 22

Khare Lab @ Rutgers University in New Brunswick, NJ

"Deep learning guided design of targeted protein editors "

There are more than 20,000 different proteins in or on a human cell. The ability to precisely target and enzymatically modify these proteins in programmed ways could allow unprecedented ability to interrogate and intervene in their biology. Advances in deep learning have enabled the design of binding proteins, particularly to targets with a well defined three dimensional structure. Building on these advances, we are developing methods to design affinity clamps that selectively bind unstructured C-terminal tails of proteins using engineered versions of natural tail binding proteins called PDZ domains. We are also designing protease enzymes with tailored specificity to modify any target protein at a chosen location. These bespoke PDZ-Protease molecules are expected to deliver a "one-two punch" to any target protein of choice. This design blueprint of our protein editors is reminescent of CRISPR technology for genome editing.

Khmelinskaia Lab @ Ludwig Maximilian University of Munich in Munich, Germany

"Expanding the functional space of de novo designed protein assemblies"

Symmetry is widely explored by biological systems, enabling the formation of large complexes from multiple copies of the same protein building block, thus expanding the functional capabilities beyond those of the monomer itself. My lab focuses on the development of protein design methods to expand the structural and functional repertoire of protein assemblies available for synthetic biology and biomedical applications. Our goal is to understand the biophysical rules guiding self-assembly and leverage them to create bioactive, responsive protein-based materials with unprecedented programmability.

Ljubetic Lab @Kemijski Inštitut - National Institute of Chemistry in Ljubljana, Slovenia

" Rigid feet for walkers"

Proteins are nature's nano-machines, driving essential biological processes, including those involving mechanical work: DNA replication, membrane remodelling, and cargo transport. Synthetic protein motors have enormous potential in health and material applications and could help us unravel the mechanism of natural protein motors.
We will combine cutting-edge AI design tools (RFDiffusion, ProteinMPPN, AlphaFold2) with single molecule tracking experiments to produce a completely de novo designed protein walker system with rigid feet. The incorporation of rigid feet enables an asymmetric interaction energy landscape that is a necessary precondition for powered walking. Powered molecular robots will open new frontiers in programmable materials and targeted therapeutic delivery.

Merck Chemical Biotechnologies in Rahway, NJ

Merck Computational and Structural Chemistry in San Francisco, CA

“Predictive modeling and design of cyclic peptides”

In antibody drug discovery, two important goals are to improve antigen binding while reducing antibody self-interactions, and modeling is useful in prioritizing engineering efforts. We have generated large datasets along with homology models and conformational ensembles for each antibody in the dataset. The successful student will leverage Rosetta to generate structure-based descriptors and use them in building predictive machine learning models. The student will work with Merck & Co. scientists to assess the advantages of derived predictions alone and in combination with state-of-the-art predictive approaches.

Merck Discovery Biologics in San Francisco, CA

“Design and engineering of Therapeutic miniproteins”

Miniproteins represent an emerging interesting modality with potential in therapeutic applications. The student will use computational design methods to engineer one or more attributes that are important in therapeutic molecules; affinity, selectivity, stability, solubility...etc. The student will be embedded in the biologics discovery department in a pharmaceutical company and will additionally learn the general processes involved in the discovery of human therapeutics.

Samanta Lab @ University of South Florida

"Physics-based and machine-learning methods to generate ensembles of membrane protein backbones"

The oligomerization of protein macromolecules on cell membranes plays a fundamental role in regulating cellular function. From modulating signal transduction to directing immune response, membrane proteins (MPs) play a crucial role in biological processes and are often the target of many pharmaceutical drugs. Despite their biological relevance, the challenges in experimental determination have hampered the structural availability of membrane proteins and their complexes. Computational docking offers a promising alternative for modeling membrane protein complex structures. The goal of this project is to compare computational tools for capturing the flexibility of membrane protein backbone, which often hinders accurate protein-protein interface predictions.

Schoeder Lab @Leipzig University in Leipzig, Germany

"Design of cellular therapies through computational protein design"

Chimeric antigen receptors (CARs) are the major antigen recognition domain for CAR T cells to detect cancer cells in the tumor microenvironment. The design of the CARs’ affinity and its biophysical properties allow for the fine-tuning of the CAR T cells effector function. Here, we will employ a combination of AI tools and classic biophysical tools to probe and manipulate CAR properties to design the desired affinity and effector function of the CAR T cell. The project can be both conducted as a purely computational project as also as a combined computational and wetlab project.

Strauch Lab @ Washington University at St. Louis in St. Louis, MO

"Protein design using ML methods, high throughput screening for protein properties"

Our lab pursues a range of interdisciplinary projects at the interface of structural biology, immunology, and protein engineering. Current efforts focus on the design of protein-based vaccines, antiviral therapeutics, and molecular machines for gene delivery, as well as the development of immune modulators. These projects integrate computational modeling, protein design, and experimental validation, offering a diverse training environment. Depending on the intern's interests and the current status of ongoing projects, any of these focus areas could form the basis for an REU project. Interns will have the opportunity to contribute to cutting-edge research while learning methods in protein design and engineering, virology, and immunoassay development.

Tandem AI (Remote Project)

"Improving Design and Selection of De Novo Therapeutic Peptides"

Peptide therapeutics are emerging as one of the most promising new drug modalities. This project focuses on improving computational methods for de novo peptide drug design, integrating machine learning, Rosetta modeling, and other physics-based approaches to identify and evaluate novel protein–peptide interfaces. The intern will gain experience in the drug discovery industry, collaborating with a cross-functional team that includes specialists in AI/ML, biophysics, and experimental peptide synthesis and assays. Depending on project needs and interests, potential directions may include designing and evaluating novel peptide candidates or developing tools and workflows that enhance the efficiency and quality of the design process.

Whitehead Lab@ University of Colorado, Boulder in Boulder, CO

"Cracking the human immune repertoire using AI"

We are a (largely) experimental lab with an audacious goal of predicting and designing how antibodies can interact with new molecular surfaces. AI protocols are poorly suited for this problem because antibodies are sufficiently distinct enough from the rest of the proteome. My group is developing and implementing new assays to provide the necessary experimental data for deep learning methods to capture antibody molecular recognition.

Yarov-Yarovoy Lab @ University of California, Davis in Davis, CA

"Design of ion channel modulators"

The overreaching goals of my research are: 1) to rationally design novel subtype-specific ion channel modulators for the treatment of pain, epilepsy, and cardiovascular diseases; 2) to study molecular determinants of ion channel modulation and gating. While we have a solid basis for understanding of the physiological role of ion channel function, the molecular mechanisms of ion channel modulation and gating remain elusive. To gain novel insights into atomic scale structural details of ion channel gating, my group is using computational biology approaches to generate experimentally testable structural hypotheses. My group is leading an effort to develop structural models of ion channels with atomic accuracy using computational modeling approaches and use this knowledge for the rational design of novel ion channel modulators that can be used for the treatment of chronic pain, epilepsy, and cardiac arrhythmias. Related recent papers:

https://pubmed.ncbi.nlm.nih.gov/39463944/

https://pubmed.ncbi.nlm.nih.gov/39189871/

https://pubmed.ncbi.nlm.nih.gov/38445990/

Copy of PXL_20250605_181854807.RAW-01.COVER

Apply Here