Sign in to view Alex’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Greater Oxford Area
Sign in to view Alex’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
237 followers
224 connections
Sign in to view Alex’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Alex
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Alex
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Alex’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Websites
- Personal Website
-
https://alexlubbock.com
- Portfolio
-
https://github.com/alubbock
About
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
Activity
237 followers
-
Alex Lubbock reposted thisAlex Lubbock reposted this📢 We're hiring a Research Software Engineer to join our team at the Franklin! This is a great opportunity to work at the intersection of scientific research, AI, and software development, helping to solve complex challenges in the life sciences. In this role you will: - Collaborate with research teams across scientific disciplines - Develop scientific software and apply AI and data science - Build and improve compute and data infrastructure (HPC, cloud, data management) We're a collaborative team, offering a dynamic environment where your work directly contributes to advancing life sciences technology. Interested? We'd love to hear from you: https://lnkd.in/enkYmvgR #hiring #lifesciences #research #technology
-
Alex Lubbock shared thisIf you’re working with data analysis scripts, simulations, or computational experiments (especially with HPC or cloud), Microbench makes it simple to: • capture important metadata automatically for easy reproducibility (git commit/branch, file hashes, library versions, hardware info...) • compare results across runs and machines • time functions with minimal overhead Microbench v2 was just released - it adds a new CLI, new metadata capture plugins, webhook outputs and more useful stuff. It's free and open source. Learn more and get started: https://lnkd.in/e_yfdnxk #python #benchmarking #reproducibility #hpc
-
Alex Lubbock shared thisWe could improve the #reproducibility of published academic software by establishing minimum standards for testing and documentation https://lnkd.in/ehMEyz4
-
Alex Lubbock shared thisGreat list of global challenges from Bill Gates. I'd also include #cancer, which will only increase in impact as global life expectancy increases, and as many cancers transition from fatal to chronic. Technological challenges exist across treatment access, prevention, development, and personalisation.
-
Alex Lubbock shared thisCancer drug discovery requires software that can calculate dynamic drug response from high throughput screens and find insights in the resulting data. Thunor is a free web application and software toolkit to simplify this process. Here's a quick demo video and blog post summary #drugdiscovery #cancer #highthroughputscreening
-
Alex Lubbock liked thisAre you curious about how biological systems are modeled and analyzed to extract knowledge? Bring your curiosity to FYIM 2026!Alex Lubbock liked thisExcited to announce the 2026 Finding Your Inner Modeler (FYIM) Workshop, sponsored by the National Science Foundation (NSF) - please help us spread the word! 📅Dates: Aug 6–7 📍Location: Irvine, CA 🔗Link to Website: https://lnkd.in/gzYt9S9d This year's theme: Modeling Biological States Across Scales 📌 Registration opens early May UC Irvine
-
Alex Lubbock liked thisAlex Lubbock liked this📢 We're hiring a Research Software Engineer to join our team at the Franklin! This is a great opportunity to work at the intersection of scientific research, AI, and software development, helping to solve complex challenges in the life sciences. In this role you will: - Collaborate with research teams across scientific disciplines - Develop scientific software and apply AI and data science - Build and improve compute and data infrastructure (HPC, cloud, data management) We're a collaborative team, offering a dynamic environment where your work directly contributes to advancing life sciences technology. Interested? We'd love to hear from you: https://lnkd.in/enkYmvgR #hiring #lifesciences #research #technology
-
Alex Lubbock liked thisAlex Lubbock liked thisHow do you build an AI‑ready research data platform that’s robust, resilient, and FAIR? The Franklin’s Advanced Research Computing (ARC) team has developed a data storage and analysis platform that automates the transfer, cataloguing, and curation of research data for speed and consistency. Ensuring it is Findable, Accessible, Interoperable, and Reusable. The software is open source, and the team is sharing its data management expertise to help embed best practice across the research computing community. Read the full case study here - https://lnkd.in/eM-vCm5K
-
Alex Lubbock liked thisAlex Lubbock liked thisCongratulations to Professor Grant Stewart on his appointment as a National Institute for Health and Care Research (NIHR) Senior Investigator. The role recognises the top 200 biomedical researchers in the UK, with just 43 newly appointed this year. This follows last week's publication of the first national NICE guideline on kidney cancer, led by Grant who is a Selwyn Fellow. Grant said: "I'm honoured by this appointment. I'm grateful to my exceptional team and collaborators whose work helped make this possible. I'm proud to continue driving advances in surgical cancer research to improve outcomes for patients." Congratulations, Grant. #SurgicalOncology #KidneyCancer #NIHRSeniorInvestigator #CancerResearch
-
Alex Lubbock liked thisCAKE - Computational Abilities Knowledge Exchange
CAKE - Computational Abilities Knowledge Exchange
1moAlex Lubbock liked this📢 New blog from CAKE Fellow Lewis Sampson! STFC Kubernetes Workshop – K8s CAKE Fellowship 🧁 Bringing the K8s community together to share knowledge & build accessible training resources. Read more 👉 https://lnkd.in/enDDT8ZW #K8s #RSE #OpenSource #CAKE -
Alex Lubbock liked thisAlex Lubbock liked this🧬 Excited to share our latest publication in Bioinformatics (Oxford)! We present mirtronDB 2.0 - an enhanced database for mirtron research in RNA biology. What's new in version 2.0? ✅ Expanded mirtron data from 2017–2025 across multiple species ✅ Newly predicted mirtrons via a pipeline combining bioinformatics & machine learning - covering 6 mammalian species ✅ Interactive dashboard for intuitive data exploration Mirtrons are non-canonical miRNAs derived from intron splicing, and understanding them is key to unlocking new layers of gene regulation. mirtronDB consolidates all this knowledge in one place - freely available to the research community. 🔗 Database: https://lnkd.in/e9aQpbtd 📄 Paper: https://lnkd.in/ez3jWNDz Huge congratulations to all co-authors (Fabiana Góes, Bruno Nichio, Vitor Gregorio, Alisson Chiquitto, Matheus Fujimura Soares, Flavia Lopes, Mark Basham, Douglas Domingues) on this milestone! 🎉 #Bioinformatics #RNAbiology #miRNA #MachineLearning #OpenScience --------------------------------------------------------- 🧬 Feliz em compartilhar nossa mais nova publicação na revista Bioinformatics (Oxford)! Apresentamos o mirtronDB 2.0 — uma nova versão do banco de dados dedicado à pesquisa de mirtrons na biologia do RNA. O que há de novo na versão 2.0? ✅ Dados expandidos de mirtrons (2017–2025) em diversas espécies ✅ Novos mirtrons preditos por um pipeline que combina bioinformática e aprendizado de máquina — cobrindo 6 espécies de mamíferos ✅ Dashboard interativo para exploração intuitiva dos dados Os mirtrons são miRNAs não canônicos derivados do splicing de íntrons e representam uma camada fascinante da regulação gênica. O mirtronDB centraliza esse conhecimento de forma gratuita e acessível para toda a comunidade científica. 🔗 Banco de dados: https://lnkd.in/e9aQpbtd 📄 Artigo: https://lnkd.in/ez3jWNDz ================================ 🧬 ¡Emocionado de compartir nuestra nueva publicación en Bioinformatics (Oxford)! Presentamos mirtronDB 2.0 — una versión mejorada de nuestra base de datos dedicada a la investigación de mirtrones en biología del RNA. ¿Qué hay de nuevo en la versión 2.0? ✅ Datos ampliados de mirtrones (2017–2025) en diversas especies ✅ Nuevos mirtrones predichos mediante un pipeline que combina bioinformática avanzada y aprendizaje automático — con cobertura de 6 especies de mamíferos ✅ Panel interactivo para una exploración de datos más intuitiva Los mirtrones son miRNAs no canónicos derivados del splicing de intrones, y comprender su función es clave para descubrir nuevas capas de regulación génica. mirtronDB centraliza este conocimiento de forma gratuita para toda la comunidad investigadora. 🔗 Base de datos: https://lnkd.in/e9aQpbtd 📄 Artículo: https://lnkd.in/ez3jWNDzmirtronDB 2.0: enhanced database with novel mirtron discoveriesmirtronDB 2.0: enhanced database with novel mirtron discoveries
-
Alex Lubbock liked thisAlex Lubbock liked thisResearchers at the Franklin studying schistosomiasis- a neglected tropical disease affecting more than 200 million people globally- have now observed the structure of a key protein thanks to a unique blend of nanobody discovery and advanced data science. One enzyme, known as Schistosoma mansoni Cathepsin D1, SmCD1, has been identified as a potential therapeutic target which the Franklin has been studying in collaboration with the London School of Hygiene and Tropical Medicine and the Oswaldo Cruz Institute in Rio de Janeiro. The group of researchers identified a high affinity nanobody that bound to the parasite enzyme and not to closely related human enzymes. By fixing the enzyme in a specific three dimensional shape, the nanobody enabled the team to determine the first crystal structure of SmCD1. This project utilised new software which supported the entire nanobody generation pipeline. Known as AntigenApp, it was developed by the Franklin Nanobody and Advanced Research Computing teams working closely together and they hope AntigenApp will be useful to the wider scientific community. “AntigenApp helped us to automate previously time-consuming tasks freeing up time and speeding up we how do things. The development of AntigenApp was a real team effort, working with the ARC team was critical to the success of this project,” said Dr Lauren Eyssen, Staff scientist, Nanobodies Discovery Platform. Read the full story here - https://lnkd.in/eUbxQp2b Kelly P. | John Clarke | Ray Owens | Alex Lubbock | Laura Shemilt | Mark BashamCapturing Complexity: How AntigenApp Is Transforming Nanobody Discovery at the Franklin - RFICapturing Complexity: How AntigenApp Is Transforming Nanobody Discovery at the Franklin - RFI
View Alex’s full profile
-
See who you know in common
-
Get introduced
-
Contact Alex directly
Other similar profiles
-
Vincent Koppelmans
Vincent Koppelmans
University of Utah School of Medicine
677 followersSalt Lake City, UT -
Debajit Chakraborty
Debajit Chakraborty
Physics & Astronomy, University of Nebraska, Kearney
598 followersKearney, NE
Explore more posts
-
James Brewster
Cime Therapeutics • 1K followers
Automation has been used in chemistry for decades. The problem for many though is applying these tools is not intuitive. Many papers describe these expensive or "home built" setups that also require the ability to code; it's not practical. Opentrons Labworks Inc. along with AI-assisted coding tools from Anthropic and OpenAI are changing that. It is now easy and affordable to start. At Cime, we are working to push past the simple one step transformations to explore new chemical space and sharing some of what we learn along the way. Patrick Doerner Barbour perfectly highlights how we do more together by open sourcing tools that are enabling us to automate synthesis (with more to come!).
18
-
Jonathan Broadbent
Sanofi • 1K followers
Hey LinkedIn, 👋 I'm starting a new series of post which will give readers a bit of an insight into the life of a data scientist. I'll be talking about some of my favourite tools/programs/softwares that I use in my day-to-day work. 🛠️ Data Science Tools #1: Vim! Vim is a low-level, retro text editor that you open as a command in the terminal. It's automatically installed in most linux/unix operating systems. To try it out just type `vim` and it will pop up. 💻 If you've never opened it before you might be confused or frustrated. (i.e. "ugh, how do I close vim!? 🤬 ") Don't worry I was there too. The idea behind programming in Vim is that your fingers never have to leave your keyboard. You can navigate across the whole editor using keyboard shortcuts. If you want to actually type something you click `i` to enter insert mode and then escape to get back to command mode (another top tip: I remapped CAPS-LOCK to escape, because who really needs CAPS-LOCK). Some useful commands I use often to navigate the editor 🚢 - `gg` go to the top of the file - `G` go to the end of the file - `ctrl+u` page up - `ctrl+d` page down - `w` move forward one word - `b` move backword one word - `/` search for something The other great thing about Vim is that it is customizable! You can install so many plugins and colorschemes onto it, make your own commands, you can really make it your own. Opening up my Vim terminal is often where I feel most at home 🏡 . Here are some of my favorite plugins - NerdTREE: Gives you a small filebrowser in pane next to your editor - Copilot: LLM generated code autocompletions (subscription required) - Colorscheme: Catppuccin-macchiato/tokyo-night - vim-airline: Gives a nice looking bar on the bottom showing your filename and folder and environment you are in - Comment: enable block commenting with `,gc` There's loads more that you can do with it. I love Vim and often people ask me why not just use VSCode or Cursor. For me it boils down to a couple reasons: 1. Speed 🏎️ : The slowest unit operation at your desk is moving your hand from your keyboard to your mouse. When you eliminate that operation your work is much faster. Albeit this does come with a learning curve first of learning the commands but that is a small cost for many gains in the future. 2. Focus 😌 : Maybe the speed argument was more relevant 5 years ago... now I admit with AI coding agents it's hard to beat that. However, I find that I never focus better on my programming then when I have a just the terminal open in dark mode and nothing else. There is no distractions and it feels great. 3. Fun 🥳 : Once you learn the commands Vim is a lot of fun. It's like this subtle game I am playing in the background whilst working. I love it. It's hacker, its retro and it's one of the small things that I lowkey love about my job. If you want to try out vim (I actually use NeoVim) you can install my setup here: https://lnkd.in/evGQBiYz
51
3 Comments -
University of Cambridge MPhil Data Intensive Science
1K followers
We’re delighted to welcome Dr Shruti Mishra for this week’s DIS seminar: 📅 Thursday 5th March 2026 ⏰ 2–3pm 📍 Room A, West Hub Title: Reinforcement Learning in Locomotion and Navigation Environments Dr Mishra will introduce the reinforcement learning framework and Markov Decision Process formulation, before presenting applications to locomotion and navigation tasks across natural and synthetic environments. The talk will explore practical design choices, such as reward shaping, observation spaces, and algorithmic considerations, and conclude with an open-source environment for experimentation. About the speaker Dr Shruti Mishra is an Encode: AI for Science Fellow in the Department of Applied Mathematics and Theoretical Physics, where she develops reinforcement learning methods for continuous control problems such as fluid flows. Her research spans reinforcement learning, mathematical modelling, and continuum mechanics. Previously, she was a Research Scientist at Sony AI and DeepMind. She holds a PhD in Applied Mathematics from Harvard University and an MEng in Mechanical Engineering from Imperial College London. ☕ Coffee and cakes to follow — all welcome! #ReinforcementLearning #ArtificialIntelligence #MachineLearning #ContinuousControl #Locomotion #Navigation #AppliedMathematics #AIforScience #ResearchSeminar #AcademicEvents
1
-
Ramy Shelbaya
Quantum Dice • 4K followers
"At the same time, compute is becoming essential to modern science from genomics, to climate modelling to materials discovery, and the science of AI itself." Last month, the UK government released its UK Compute Roadmap (link below). It's great to see that amongst all the hype around Gen AI, decision makers still have a broader vision and understanding of the importance of the development of computing technologies for the entire modern economy beyond just this, important but often overemphasized, use case. The UK has long been at the forefront of computing technologies but it will require a concrete vision and more, importantly, a decisive and patient implementation strategy to maintain, and grow, its leading position. Reading through the roadmap, the vision is clearly there but now comes the hard part. Excited to see what comes next! https://lnkd.in/dbdUW9Mb
27
1 Comment -
Ganesh Venkatesh
Waymo • 2K followers
I'm incredibly proud to announce a productive NeurIPS for our Post-training AI Research team. We had two papers on test-time scaling accepted into the Workshop on Efficient Reasoning, contributing to a fantastic ~6 acceptances for the wider Applied AI Research @ Cerebras. Both of our papers are now available on ArXiv: - The Conductor and the Engine: A Path Towards Co-Designed Reasoning. Link - https://lnkd.in/g5Dnnz6C. - Calibrated Reasoning: An Explanatory Verifier for Dynamic and Efficient Problem-Solving. Link - https://lnkd.in/gH2jTGeP. In the spirit of collaboration, we are also open-sourcing our updated CePO flow as part of our OptiLLM library on GitHub. This is the same methodology that achieved top scores on the Artificial Analysis Leaderboard. Link - https://lnkd.in/gd6anwFN Congratulations to the team on this incredible milestone: Anisha Garg, David Bick, Engin Tekin, Michael Wang, Pawel Filipczuk, Amaan Dhada, Yash More, Nishit N. and rest of the Post-training Team! Looking Ahead A key enabler for these results and our upcoming work in Coding Agents has been using Reinforcement Learning (RL) to provide LLMs with new, specialized expertise, which makes them highly amenable to test-time compute scaling. This brings me to our next exciting step... 🚀 Announcing Limited Early Access to our RL Service! 🚀 Is your team excited by the potential of powerful LLM models — closed source like GPT-5/Claude/Gemini or Open-source like Qwen3, GPT-OSS — but frustrated when they fail that "last mile" on your specific, critical tasks? We are opening up a limited early access program to help you solve this. Our RL service is designed to transform general models into world-class experts for your unique domain. If you're interested in building an AI system that is an expert at solving your tasks, reach out to me to see if you're a fit for the program. 📧 Email: ganesh.venkatesh@cerebras.net When you reach out, please include: - Subject: "RL Service Early Access" - Body: A brief description of your application and the "last-mile" challenges you're facing. Looking forward to post-training your problems away!
122
1 Comment -
R Consortium
11K followers
New on the R Consortium blog: “Contributing to base R with Coding Equity and Joy — Inside the R Contributors Project.” Ella Kaye, Senior Research Software Engineer, University of Warwick, shares how the R Contributors project is making it easier—and more welcoming—to contribute to base R: R Developer Days, monthly contributor office hours, and a C Study Group for R contributors. She also explains why using GitHub (issues, discussions, labels) can lower barriers vs. Bugzilla. Bonus: a fun case study on learning-with-joy through the “aperol” R package—and how community feedback turned a silly idea into real learning. Bonus-bonus: Ella covers the history of rainbowR, a community that connects, supports and promotes LGBTQ+ folk who code in R, and spreads awareness of LGBTQ+ issues through data-driven activism. Read it all here: https://lnkd.in/ggnbiVjH #rstats #opensource #RProject #Community #ResearchSoftware
48
-
Society for Technological Advancement (SoTA)
2K followers
Winners of SoTA’s Opening the Black Box interpretability hackathon, Nitish Mital, (PhD) and Simon Malzard, PhD explore how synthetic data can be made smarter, not just bigger. In their letter to SoTA, they introduce a SHAP-guided synthetic data refinement approach - using explainable AI to identify and correct the features that mislead models. By closing the loop between explainability, simulation, and data design, their work points toward a future where synthetic data actively teaches models to see the world more accurately. Read their letter in full (link in the comments) to learn about their project, how it relates to their PoC at the SoTA Hack last January, and what their future plans are.
23
1 Comment -
The Alan Turing Institute
113K followers
How is AI changing our lives? 🎧 The Turing's Adrian Weller joined Brian Cox, Neil Lawrence, Steph Wright and Jeanette Winterson to explore this fundamental question. Listen to the latest episode of The Francis Crick Institute’s A Question of Science podcast: https://lnkd.in/eR3yedD3 #AI #AIEthics #AIforGood #AIGovernance
163
1 Comment -
Miguel Bragança
InstaDeep • 786 followers
AbBFN2 is out! We introduce a new generative AI model which has a flexible, multi-objective approach to antibody engineering. 🚀 Try AbBFN2: https://lnkd.in/dN43HPT8 📄 Preprint: https://lnkd.in/dFhehcSw 💻 Access the code: https://lnkd.in/dNdB6sC2
40
5 Comments -
Jesus College Oxford
7K followers
We are pleased to share that Dr Sergii Strelchuk, Tutorial Fellow in Computer Science at Jesus College and Associate Professor at Oxford, has been selected to lead the final phase of Quantum Pangenomics within the prestigious Wellcome Leap Q4Bio Challenge. This international project brings together experts from quantum computing and the life sciences to explore how quantum approaches might transform the analysis of large and complex genomic datasets, with potential benefits for personalised medicine and disease detection. Supported by Wellcome Leap and involving partners including the Wellcome Sanger Institute, the University of Cambridge and the University of Melbourne, the initiative places Oxford at the centre of a rapidly developing field. 🔎👉 Read more: https://lnkd.in/eKV7Ez2m
21
-
UKRI CDT in Foundational AI
2K followers
We're delighted to release another "Conversations with PhD Researchers" interview - this time with PhD Student Emilio McAllister Fognini. In this episode, Emilio introduces us to the world of Neural Operators - a powerful and abstract class of machine learning techniques used to model differential equations. At the CDT, we’re proud to support research pushing the boundaries of computational science and creating pathways toward real‑world impact. Emilio’s work is a fantastic example of the innovation emerging from our centre, and we would like to pass on our thanks to Emilio for the fantastic explanation and to Reuben Adams for producing another great interview! UCL Computer Science UCL Centre for AI https://lnkd.in/gEnVnVbY
9
1 Comment -
Javier Quílez Oliete
M42 Health • 4K followers
Wrapping up London Calling 2025 - Oxford Nanopore Technologies’ annual meeting - and its satellite Informatics Day with a few personal highlights (in no particular order): 1. R10 chemistry is here to stay No updates are foreseen in the R10 chemistry, which is welcome news for those working toward clinical implementation of Oxford Nanopore - more on this below. 2. Dorado as a variant calling framework "Dorado is not just a basecaller, it's a framework. [...] It's time for ONT to take more ownership of variant calling.". ONT developing its own variant caller as part of the Dorado engine (it still will use Minimap2 for mapping). Preliminary results show promising performance beyond Clair3, ONT’s current recommendation. 3. Diverse and growing use in clinical care Biased as we might be as event attendees, the breadth of real-world clinical genomics applications is impressive. From early adopters like Danny Miller - who now predicts broader adoption within 3 years - to national programs and clinical labs transitioning from short-read to ONT, the shift seems both credible and imminent. 4. Adaptive sampling love A favorite among users for its flexibility: a cost-effective method to target specific regions while gaining bonus data from off-target regions (e.g. imputation, methylation). Hard to imagine its use declining unless WGS costs drop significantly. 5. Genome assemblies for humans Unexpectedly (to me), ONT is now highlighting its value for more than just microbes. In human genetics, ONT offers better resolution of complex regions (e.g. *SMN1/2*), accelerating pan-genome research and utility. 6. Proteomics: still early days While ONT’s expansion into protein profiling is often mentioned at London Calling, it still feels a few years out for routine users. That said, Jeff Nivala’s work in this space is highly promising and clearly setting the stage. 7. Data standards & sharing A maturing field. There was a strong multidisciplinary focus on how data is stored, represented, and shared among stakeholders - clinicians, patients, collaborators - which signals a maturing ecosystem striving for responsible, trusted data use. 8. Clinical reporting and literacy Important conversations around how to upskill clinicians in genomics and ensure they feel confident reporting findings to patients. Essential for real-world clinical integration. A few quotes that stayed with me: * Ewan Birney on data standards: “If we look back, we’ve come a long way - but we’re not at the top of the mountain.” * Rowan Gardner on adoption: “You’ll never change clinical practice if you try to hit on it.” * Georgina Dawson: “People can only understand what they can perceive.” #nanoporeConf
33
3 Comments -
Quantum Motion
15K followers
Quantum Motion brings together an amazing group of people to build amazing things. Listen to the team talk about delivering our first, full-stack quantum computer to the UK's National Quantum Computing Centre (NQCC). "This could only have been done here at Quantum Motion because we've gathered the expertise of all the necessary teams to deliver this full-stack system. We have quantum hardware engineers, intelligent automation professionals, we have theorists, we have experts in modelling, and without all these people, this endeavour wouldn't have been possible. It's not that we have all these teams, it's that we have the best people in each one of these teams." - M. Fernando González Zalba, Principal Quantum Engineer at Quantum Motion. Drive the future of quantum computing. Join the team: https://lnkd.in/eJWgAGj6 Team featured: April Carniol, Anna Stockklauser, David J. Ibberson, Katarina Brlec, Giovanni Oakes, David Wise, Tuula Ritakari
122
5 Comments -
Laura Crawford
240 followers
I'm looking forward to attending the CoSeC conference and sharing the metadata work we've been developing within the ARC team at the Rosalind Franklin Institute. FAIR pipelines aren't built overnight, but automating metadata capture is a big step towards more transparent, reproducible, and efficient data management across the research lifecycle. This talk is a snapshot of where we are now and where we're heading next
24
-
Jaber Jaber
Actimo Labs • 898 followers
Is there a bitter lesson in antibody modelling? In 2019, Rich Sutton published a now famous blog post titled "The Bitter Lesson". The core argument is that generic models that scale with compute tend to overtake specialized, domain-specific approaches. For LLMs, much of their emergent behaviour can be attributed to scale. This convinced researchers to spend less time worrying about minor architectural tweaks to account for novel edge cases and keep scaling their compute, dataset size, and parameter count. For protein models, consider the results from the Ginkgo Datapoints Developability Competition (held by Ginkgo Bioworks). 113 teams tried to predict antibody properties from sequence alone. The best models showed moderate correlation for clearer properties like hydrophobicity, but for complex ones like self-association and expression titer, predictions were falling short. Models that looked strong in validation struggled on the held-out set. The authors name the bottleneck directly: available datasets are too small and too heterogeneous. If you follow the bitter lesson to its conclusion: as the protein language models tested in this competition scale their data, their ability to generate zero-shot developable antibodies will continue to improve. A recent finding is starting to demonstrate this! A team at JURA Bio, Inc. recently used a specialized method - variational synthesis - to design ~10^17 antibody sequences for roughly $1,000. The method folds the generative model into the DNA synthesis process itself, so you define a target distribution over antibody sequences, optimize the physical synthesis parameters so the library coming off the machine approximates that distribution, and let the chemistry do the sampling. In their blog post, the authors train a transformer with cross attention and CLIP-style contrastive learning on their dataset to predict scFv-pHLA interactions, which is simpler than other models leveraging MSAs/structural data. The model generalised to unseen antibodies and unseen targets, with some targets reaching over 100,000x enrichment for specific binders. This also reminded me of Leash Bio's work with Hermes, described well in a blog post by Abhishaike Mahajan from late last year. Hermes is a lightweight 50M-parameter transformer trained exclusively on Leash's proprietary protein-ligand binding. On held-out chemistry with zero overlap to training, it outperformed Boltz-2, a far more complex structural model. The takeaway was that dense, high-quality, internally consistent data can outweigh architectural sophistication. So, is there a bitter lesson in antibody modelling? These evidences suggest yes. When you get the data right, simpler models can keep up with or outperform more complex ones. We're thinking carefully about the data pipelines at Actimo Labs and how we can leverage simpler models for better, more developable antibodies. #antibodyDiscovery #drugDiscovery #machineLearning #computationalBiology
35
6 Comments -
Corin Wagen
Rowan • 5K followers
While there's been a lot of interest in using NNPs to study protein–ligand interactions with DFT-level accuracy, Ishaan Ganti found that NNPs struggle for these complexes and physics-based xTB methods do much better. "Quantity has a quality of its own," as the saying goes...
66
5 Comments -
Deniz Kavi
Tamarind Bio • 13K followers
Grand Challenges in Computational Small Molecule Drug Discovery This work, a massive undertaking two years in the making, surveys scientific and technical problems where better prediction would materially improve drug discovery outcomes. Benchmarks of methods or models are certainly useful, but we've still not agreed which problem spaces AI can be applied to actually mean "better" drug discovery. Chemistry: synthesis planning, process chemistry, covalent inhibitor design, chemical stability/degradation Structure: crystal packing/polymorphism, protein structure, protein dynamics, protein–ligand pose prediction, cryptic pocket discovery Energy: binding affinity, selectivity, kinetics, allostery/agonism Pharmacology: pKa, solubility/aggregation/permeability, plasma protein binding and volume of distribution, clearance, oral bioavailability, metabolism, toxicity, dose prediction, PK/PD The authors propose that the AI-led transformation will come from solving specific, measurable problems as opposed to fully end-to-end black box solutions. Even in the world where the latter comes true, these challenges are highly valuable evaluations for the efficacy of future protocols. For each challenge the authors outline: • the underlying physical problem • why it matters • the current state of the field • inputs, outputs, and data types • metrics that would define meaningful progress Congratulations on the preprint Woody Sherman, Connor W. Coley, and co-authors. ———— Tamarind Bio is a collection of 250+ molecular design tools such as AlphaFold and most of current best solutions to the grand challenges discussed in the paper, accessible via web interface and API.
234
7 Comments
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content