How to Use EvoProtGrad Online

Try EvoProtGrad

Commercially Available Online Web Server

EvoProtGrad: Generating Artificial Enzymes with Gradient-Based MCMC

EvoProtGrad (Evolution Proteins Gradient) is a plug & play directed evolution of proteins using a method called gradient-based discrete Markov Chain Monte Carlo (MCMC). This approach provides a computationally efficient way to navigate the massive, discrete space of protein sequences and efficiently discover beneficial mutations. By leveraging gradients from existing neural networks, this method accelerates the generation of functional protein variants, outperforming classical evolutionary algorithms.

How EvoProtGrad Works

The core of this method is a gradient-based discrete MCMC algorithm that is designed to be model-agnostic (Plug & Play).

  1. Black Box Integration: The framework integrates any existing neural network (e.g., AlphaFold-Multimer or another evolutionary model) as a "black box" fitness predictor without requiring fine-tuning or structural information.

  2. Gradient-Guided Search: The MCMC approach uses gradients derived from this external fitness predictor to propose beneficial single-site mutations. This gradient information is key to biasing the random walk toward sequences with higher predicted fitness, which is significantly more efficient than blind MCMC sampling.

  3. Multi-Objective Optimization: The method naturally supports optimizing multiple objectives concurrently. For instance, a researcher can simultaneously guide the evolution of a protein to maximize stability while minimizing immunogenicity using a composite scoring function.

What is Tamarind Bio?

Tamarind Bio is a pioneering no-code bioinformatics platform built to democratize access to powerful computational tools for life scientists and researchers. Recognizing that many cutting-edge machine learning models are often difficult to deploy and use, Tamarind provides an intuitive, web-based environment that completely abstracts away the complexities of high-performance computing, software dependencies, and command-line interfaces.

The platform is designed provide easy access to biologists, chemists, and other researchers who may not have a background in programming or cloud infrastructure but want to run experimental models with their data. Key features include a user-friendly graphical interface for setting up and launching experiments, a robust API for integration into existing research pipelines, and an automated system for managing and scaling computational resources. By handling the technical heavy lifting, Tamarind empowers researchers to concentrate on their scientific questions and accelerate the pace of discovery.

Accelerating Discovery with EvoProtGrad on Tamarind Bio

Using this EvoProtGrad (gradient-based directed evolution) framework on a platform like Tamarind would accelerate protein engineering campaigns by providing an efficient, flexible, and powerful optimization tool.

  • Rapid Sequence Optimization: The computational efficiency of the gradient-based MCMC allows researchers to quickly propose and screen thousands of high-potential mutations, drastically shortening the optimization cycle for existing lead proteins.

  • Integrated Multi-Objective Design: Researchers can define complex design goals—such as maximizing enzyme activity while ensuring high thermostability—and the platform will manage the concurrent optimization using the framework's multi-objective capabilities.

  • Leverage Existing Models: Because the method is Plug & Play, researchers can use any accurate, off-the-shelf prediction model as the fitness oracle without worrying about complex model fine-tuning or development.

How to Use EvoProtGrad on Tamarind Bio

To leverage the power of this method, a researcher could follow this streamlined workflow on Tamarind:

  1. Access the Platform: Begin by logging in to the tamarind.bio website.

  2. Select EvoProtGrad: From the list of available computational models, choose the EvoProtGrad tool.

  3. Input a Seed Sequence: Provide the amino acid sequence of the wild-type or parental protein to begin the evolutionary search.

  4. Define Objectives: Select one or more computational metrics (e.g., AlphaFold-Multimer confidence score, solubility score, or predicted binding affinity) to serve as the optimization objectives.

  5. Run Gradient-Based MCMC: The platform runs the core gradient-based discrete MCMC algorithm, which iteratively proposes and accepts beneficial mutations in a single sequence optimization process.

  6. Acquire Optimized Sequences: The output provides a set of highly optimized protein variants, ranked by their predicted composite fitness score, ready for experimental validation.

Source