How to Use RFdiffusion Online

How to Use RFdiffusion Online

Try RFdiffusion

Commercially Available

RFDiffusion: A Generative Model for Protein Design

RFDiffusion, or RoseTTAFold diffusion, is a powerful new method for protein structure design that combines a generative diffusion model with a fine-tuned structure prediction network. Unlike previous approaches that often relied on manually specified "blueprints" or struggled to produce diverse and stable structures, RFDiffusion can generate a wide range of new proteins from simple molecular specifications.

The method was developed by fine-tuning the RoseTTAFold network on protein structure denoising tasks. This approach overcomes the limitations of earlier deep learning methods, which had limited success in generating sequences that fold to the intended structures. RFDiffusion has achieved outstanding performance in various design challenges, including:

  • Unconditional protein generation

  • Protein binder design

  • Symmetric assemblies and oligomers

  • Enzyme active site scaffolding

The accuracy and utility of RFDiffusion have been experimentally confirmed by characterizing the structures and functions of hundreds of designed proteins. The cryogenic electron microscopy (cryo-EM) structure of a designed binder was shown to be nearly identical to its computational model, validating the method's high precision.

How RFDiffusion Works

RFDiffusion operates on a denoising diffusion probabilistic model (DDPM), similar to those used for generating images from text. The process involves two main steps:

  1. Forward Process (Noising): This is a stochastic process that gradually adds noise to known protein structures from a database like the Protein Data Bank (PDB). This corruption process continues over multiple steps until the original protein structure is transformed into pure Gaussian noise. The model works on a rigid-frame representation of protein residues, which includes their Cα coordinates and N-Cα-C orientations.

  2. Reverse Process (Denoising): This is the core of the generative model. The RFDiffusion network is trained to learn how to reverse the forward process by predicting and removing the noise added at each step. Starting from a state of random noise, the model iteratively de-noises the structure, progressively refining it until it becomes a realistic protein backbone. The denoising network is a fine-tuned version of the RoseTTAFold structure prediction model, which provides a deep understanding of protein sequence and structure relationships.

The output is a diverse set of protein structures that are not limited by existing protein folds but are still physically plausible and stable. The model can also be "guided" with design specifications, such as a specific motif or target, to generate new proteins with desired functions.

What is Tamarind Bio?

Tamarind.bio is a pioneering no-code bioinformatics platform built to democratize access to powerful computational tools for life scientists and researchers. Recognizing that many cutting-edge machine learning models are often difficult to deploy and use, Tamarind Bio provides an intuitive, web-based environment that completely abstracts away the complexities of high-performance computing, software dependencies, and command-line interfaces.

The platform is designed to be accessible to biologists, chemists, and other researchers who may not have a background in programming or cloud infrastructure. Key features include a user-friendly graphical interface for setting up and launching experiments, a robust API for integration into existing research pipelines, and an automated system for managing and scaling computational resources. By handling the technical heavy lifting, Tamarind.bio empowers researchers to concentrate on their scientific questions and accelerate the pace of discovery.

Accelerating Discovery with RFDiffusion on Tamarind Bio

The integration of RFDiffusion's advanced generative capabilities with Tamarind.bio's user-centric platform creates a powerful synergy that can significantly accelerate the drug discovery and research process. This combination allows researchers to leverage RFDiffusion's strengths in an environment that is both simple to use and highly scalable.

  • Diverse and Reproducible Protein Designs: RFDiffusion generates a wide variety of high-quality protein backbones, allowing researchers to explore novel structural space beyond what is known in nature. This is a major advantage over traditional methods, which are often limited to a single solution.

  • Rapid Iteration and Screening: By abstracting away the computational and technical complexities, Tamarind.bio allows for rapid iteration of protein design. This empowers researchers to quickly design, test, and refine their protein candidates, drastically reducing the time and cost associated with traditional protein engineering workflows.

  • Democratizing Protein Design: The no-code interface and scalable cloud infrastructure make powerful tools like RFDiffusion accessible to a broader scientific community, including researchers in academia and small biotech companies who might not have access to dedicated computational resources.

In essence, Tamarind.bio and RFDiffusion together provide a comprehensive, end-to-end solution for de novo protein design that is fast, accurate, and accessible, empowering a new generation of scientists to make groundbreaking discoveries.

How to Use RFDiffusion on Tamarind Bio

Tamarind Bio makes using RFDiffusion straightforward and efficient, regardless of your technical expertise. The no-code platform streamlines the entire workflow for de novo protein design.

Here is a simple, step-by-step guide for researchers to get started:

  1. Access the Platform: Begin by logging in to the tamarind.bio website.

  2. Select RFDiffusion: From the list of available computational models, choose RFDiffusion.

  3. Specify Design Goals: Instead of writing complex code, you'll use a user-friendly interface to specify your design goals. This could be as simple as defining the desired length of a protein, or more complex, such as providing a motif for the new protein to bind to (motif scaffolding). The platform handles all necessary file conversions and preprocessing automatically.

  4. Submit and Monitor: With the click of a button, you can submit your job. The Tamarind.bio platform then handles the allocation of powerful GPU resources and executes the RFDiffusion simulation. You can monitor the progress of your job directly from the dashboard, receiving real-time updates without needing to check the command line.

  5. Analyze the Results: Once the job is complete, you will receive a comprehensive report. The results include a diverse set of novel protein designs that can be further analyzed to select the best candidates for experimental testing. The platform also provides a convenient way to manage these designs and their associated data.

Empowering 10,000+ scientists around the world

from academia to startups to large pharma & therapeutics

Connect to Content

Add layers or components to infinitely loop on your page.

About Tamarind Bio

Provides a no-code, web-based platform and API that gives life scientists and researchers access to powerful AI and computational tools. Their services are designed to simplify complex bioinformatics tasks, particularly in structural biology, protein design, and drug discovery.

Tamarind Bio's Mission

Arm scientists with the tools to invent the future of biology and enable creative discovery.