Arian Amani

Machine Learning Scientist

Biography

I am a Machine Learning Scientist at AI VIVO and a Data Scientist at the Wellcome Sanger Institute. My work is at the intersection of computational biology and drug discovery, where I develop deep generative and foundation models for molecules and cells. I specialize in molecule generation and single-cell perturbation modeling using advanced techniques like VAEs, Diffusions, Transformers, and Flow Matchings. I’m passionate about building AI methods that accelerate target discovery and therapeutic design.

Interests

Deep Generative Models
Drug Discovery
Single-Cell Genomics
Representation Learning

Education

Applied Computer Science & Artificial Intelligence, 2023
Sapienza University of Rome
BSc in Computer Science, 2020
Amirkabir University of Technology

Work Experience

Machine Learning Scientist

AI VIVO

December 2024 – Present Cambridge, United Kingdom

Develop deep learning and generative models for drug discovery using transformer and flow matching architectures
Deploy and scale ML pipelines on GCP using PyTorch Lightning and Docker
Design multi-modal ML pipelines integrating molecular structure and biological assay data
Maintain scalable pipelines using PyTorch, Lightning, RDKit, and HuggingFace
Experience with computational chemistry tools like BioSolveIt, AutoDock Vina, and Boltz-2

Data Scientist

Wellcome Sanger Institute

November 2022 – Present Hinxton, United Kingdom

Responsibilities include:

Research assistant at Dr. Mo Lotfollahi’s lab
Co–first author of CellDISECT, a deep generative model for disentangled cellular representations and in silico perturbation analysis, developed to study perturbation effects across single-cell populations.
Collaborated on several ML projects in Single-Cell Genomics and Drug Discovery mainly using PyTorch
Developed task-specific fine-tuning pipelines on generative and transformer models
Contributed to ongoing projects like CPA: Compositional Perturbation Autoencoder (GitHub)

Computer Vision Engineer

Virasad

January 2022 – May 2022 Tehran, Iran

Responsibilities include:

Delivered >95% accuracy solutions for tasks with limited data (15 images per class)
Spearheaded development on 5 diverse projects meeting client requirements
Led individual projects, enhancing development pipelines

Featured Publications

Stathis Megas, Arian Amani, Antony Rose, Olli Dufva, Kian Shamsaie, Hesam Asadollahzadeh, Krzysztof Polanski, Muzlifah Haniffa, Sarah Amalia Teichmann, Mohammad Lotfollahi

June, 2025 bioRxiv

Integrating multi-covariate disentanglement with counterfactual analysis on synthetic data enables cell type discovery and counterfactual predictions

CellDISECT is a powerful causal generative model that enhances single-cell analysis by disentangling variations, making counterfactual predictions, and achieving flexible fairness.

Projects

CellDISECT

CellDISECT (Cell DISentangled Experts for Covariate counTerfactuals) is a powerful causal generative model that enhances single-cell analysis by disentangling variations, making counterfactual predictions, and achieving flexible fairness.

CPA (Compositional Perturbation Autoencoder)

CPA is a deep generative framework to learn effects of perturbations at the single-cell level. It performs OOD predictions of unseen combinations of drugs, learns interpretable embeddings, estimates dose-response curves, and provides uncertainty estimates.

Recent Publications

Quickly discover relevant content by filtering publications.

Stathis Megas, Arian Amani, Antony Rose, Olli Dufva, Kian Shamsaie, Hesam Asadollahzadeh, Krzysztof Polanski, Muzlifah Haniffa, Sarah Amalia Teichmann, Mohammad Lotfollahi (2025). Integrating multi-covariate disentanglement with counterfactual analysis on synthetic data enables cell type discovery and counterfactual predictions. bioRxiv.

PDF Cite Code Project DOI bioRxiv

Recent & Upcoming Talks

ML in Single-Cell Genomics Workshop

Taught a hands-on workshop on deep learning methods in single-cell genomics

Jun 1, 2024 1:00 PM — 3:00 PM RSGIran

Introduction to Adversarial Machine Learning

Can you hack deep learning models? Imagine your self driving car sees a Stop sign and thinks, yea that is definitely a 120km/h speed limit sign.

Jun 1, 2022 1:00 PM — 3:00 PM Computer Science Department, AUT

Introduction to Adversarial Machine Learning

Teaching Experience

Teaching Assistant

Sharif University of Technology

September 2022 – August 2023 Tehran, Iran

Machine Learning for Bioinformatics (Graduate Course) | Spring 2023
- Prepared teaching material on CNNs & AutoEncoders, designed assignments, and coordinated class contests.
Introduction to Machine Learning | Fall 2022
- Designed and graded assignments for a class of 150 students, conducted a workshop on Variational AutoEncoders.

Teaching Assistant

Amirkabir University of Technology

September 2021 – March 2022 Tehran, Iran

Introduction to Image Processing and Neural Networks | Fall 2022
- Conducted workshops and lectures on OpenCV and Deep Learning for a class of 80 students.
Advanced Programming with C++ | Spring 2022
- Designed assignments and projects for a class of 90 students, evaluated student submissions.

Accomplishments

Deep Learning Specialization

Coursera Aug 2022

Neural Networks and Deep Learning
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
Structuring Machine Learning Projects
Convolutional Neural Networks
Sequence Models

See certificate

Arian Amani

Machine Learning Scientist

AI VIVO

Wellcome Sanger Institute

Biography

Work Experience

Featured Publications

Recent Posts

Projects

Recent Publications

Recent & Upcoming Talks

Teaching Experience

Accomplishments

Contact

Arian Amani

Machine Learning Scientist

AI VIVO

Wellcome Sanger Institute

Biography

Work Experience

Featured Publications

Recent Posts

Projects

Recent Publications

Recent & Upcoming Talks

Teaching Experience

Accomplish­ments

Contact

Accomplishments