How to Use RFdiffusion Online

Try RFdiffusion

Commercially Available Online Web Server

RFDiffusion: A Generative Model for Protein Design

RFDiffusion, or RoseTTAFold diffusion, is a powerful new method for protein structure design that combines a generative diffusion model with a fine-tuned structure prediction network. Unlike previous approaches that often relied on manually specified "blueprints" or struggled to produce diverse and stable structures, RFDiffusion can generate a wide range of new proteins from simple molecular specifications.

The method was developed by fine-tuning the RoseTTAFold network on protein structure denoising tasks. This approach overcomes the limitations of earlier deep learning methods, which had limited success in generating sequences that fold to the intended structures. RFDiffusion has achieved outstanding performance in various design challenges, including:

Unconditional protein generation
Protein binder design
Symmetric assemblies and oligomers
Enzyme active site scaffolding

The accuracy and utility of RFDiffusion have been experimentally confirmed by characterizing the structures and functions of hundreds of designed proteins. The cryogenic electron microscopy (cryo-EM) structure of a designed binder was shown to be nearly identical to its computational model, validating the method's high precision.

How RFDiffusion Works

RFDiffusion operates on a denoising diffusion probabilistic model (DDPM), similar to those used for generating images from text. The process involves two main steps:

Forward Process (Noising): This is a stochastic process that gradually adds noise to known protein structures from a database like the Protein Data Bank (PDB). This corruption process continues over multiple steps until the original protein structure is transformed into pure Gaussian noise. The model works on a rigid-frame representation of protein residues, which includes their Cα coordinates and N-Cα-C orientations.
Reverse Process (Denoising): This is the core of the generative model. The RFDiffusion network is trained to learn how to reverse the forward process by predicting and removing the noise added at each step. Starting from a state of random noise, the model iteratively de-noises the structure, progressively refining it until it becomes a realistic protein backbone. The denoising network is a fine-tuned version of the RoseTTAFold structure prediction model, which provides a deep understanding of protein sequence and structure relationships.

The output is a diverse set of protein structures that are not limited by existing protein folds but are still physically plausible and stable. The model can also be "guided" with design specifications, such as a specific motif or target, to generate new proteins with desired functions.

Comparing RFdiffusion2 & RFdiffusion

	RFdiffusion	RFdiffusion2
Active Site Specification	Required specification at the residue level, often requiring complex inverse rotamer calculations.	Allows for atomic-level or partial ligand specification, eliminating the need for complex calculations.
Design Success Rate (AME Benchmark)	Successful in generating scaffolds for only 16 out of 41 diverse enzyme active sites.	Successfully generated scaffolds for all 41 sites, achieving a 100% success rate.
Flexibility & Control	More rigid design, requiring pre-specified sequence indices for catalytic residues.	Sequence-agnostic design with added features like stochastic centering and partial ligand specification for greater control.
Training & Architecture	Based on the original RFdiffusion framework.	Utilizes a more efficient flow matching training framework and the RoseTTAFold All-Atom neural network architecture to model side chain postures directly.

We encourage you to test both models out to see which works best for your specific experiment.

What is Tamarind Bio?

Tamarind Bio is a pioneering no-code bioinformatics platform built to democratize access to powerful computational tools for life scientists and researchers. Recognizing that many cutting-edge machine learning models are often difficult to deploy and use, Tamarind provides an intuitive, web-based environment that completely abstracts away the complexities of high-performance computing, software dependencies, and command-line interfaces.

The platform is designed provide easy access to biologists, chemists, and other researchers who may not have a background in programming or cloud infrastructure but want to run experimental models with their data. Key features include a user-friendly graphical interface for setting up and launching experiments, a robust API for integration into existing research pipelines, and an automated system for managing and scaling computational resources. By handling the technical heavy lifting, Tamarind empowers researchers to concentrate on their scientific questions and accelerate the pace of discovery. The Tamarind team hold information/data security as a top priority, as detailed in our Trust Center & Terms of Service, ensuring your data is safe on the platform.

Accelerating Discovery with RFDiffusion on Tamarind Bio

The integration of RFDiffusion's advanced generative capabilities with Tamarind.bio's user-centric platform creates a powerful synergy that can significantly accelerate the drug discovery and research process. This combination allows researchers to leverage RFDiffusion's strengths in an environment that is both simple to use and highly scalable.

Diverse and Reproducible Protein Designs: RFDiffusion generates a wide variety of high-quality protein backbones, allowing researchers to explore novel structural space beyond what is known in nature. This is a major advantage over traditional methods, which are often limited to a single solution.
Rapid Iteration and Screening: By abstracting away the computational and technical complexities, Tamarind.bio allows for rapid iteration of protein design. This empowers researchers to quickly design, test, and refine their protein candidates, drastically reducing the time and cost associated with traditional protein engineering workflows.
Democratizing Protein Design: The no-code interface and scalable cloud infrastructure make powerful tools like RFDiffusion accessible to a broader scientific community, including researchers in academia and small biotech companies who might not have access to dedicated computational resources.

In essence, Tamarind.bio and RFDiffusion together provide a comprehensive, end-to-end solution for de novo protein design that is fast, accurate, and accessible, empowering a new generation of scientists to make groundbreaking discoveries.

How to Use RFDiffusion on Tamarind Bio

Tamarind Bio makes using RFDiffusion straightforward and efficient, regardless of your technical expertise. The no-code platform streamlines the entire workflow for de novo protein design.

Here is a simple, step-by-step guide for researchers to get started:

Access the Platform: Begin by logging in to the tamarind.bio website.
Select RFDiffusion: From the list of available computational models, choose RFDiffusion.
Specify Design Goals: Instead of writing complex code, you'll use a user-friendly interface to specify your design goals. This could be as simple as defining the desired length of a protein, or more complex, such as providing a motif for the new protein to bind to (motif scaffolding). The platform handles all necessary file conversions and preprocessing automatically.
Submit and Monitor: With the click of a button, you can submit your job. The Tamarind.bio platform then handles the allocation of powerful GPU resources and executes the RFDiffusion simulation. You can monitor the progress of your job directly from the dashboard, receiving real-time updates without needing to check the command line.
Analyze the Results: Once the job is complete, you will receive a comprehensive report. The results include a diverse set of novel protein designs that can be further analyzed to select the best candidates for experimental testing. The platform also provides a convenient way to manage these designs and their associated data.

Source

Supporting 10,000+ scientists around the world,

from leading biotechs, and global biopharma

Get started

Book a demo