Short amyloids β antimicrobial peptides? π§¬βπ¦
amyloids, antimicrobial peptides, AMP, machine learning, AmpGram, negative dataset, bioinformatics
π Project highlights
- 𧬠Tests short amyloids predicted as AMPs
- π€ Uses AmpGram + 15 ML models
- π§ͺ Validates experimentally on 10 bacterial strains
- β Finds no antimicrobial activity
- π Proposes high-quality negative dataset for ML
π New paper!
π testing if amyloids can act like antimicrobial peptides
π Testing Antimicrobial Properties of Selected Short Amyloids
π§ Audio summary
Machine learning said βthese might be AMPsββ¦
experiments said βnopeβ π
π Hereβs a quick breakdown π§:
π¬ What is this about?
Amyloids and antimicrobial peptides (AMPs) look surprisingly similar:
- both can disrupt membranes
- both form aggregates (Ξ²-sheet structures)
- both interact with the immune system
π So the question: Can short amyloids act as antimicrobial peptides?
βοΈ The core problem
AMP discovery relies heavily on machine learning
BUT:
π models lack true negative examples
- very few experimentally confirmed non-AMPs
- datasets often contain false negatives
π leads to over-optimistic predictions
π§ What they did
π§© Step 1: ML-based selection
- screened 509 amyloids (WALTZ-DB)
- used AmpGram (n-grams + random forest)
- selected top 10 candidates
π only ~6% predicted as AMPs
βοΈ Step 2: model comparison
- tested 15 additional AMP predictors
π result:
- highly inconsistent predictions
- only 1 peptide agreed across all models
π§ͺ Step 3: experimental validation
They tested:
- π¦ 10 bacterial strains (Gram+ & Gramβ)
- π§« peptide concentrations up to 128 Β΅g/mL
- 𧬠human cell toxicity
π Key results
β No antimicrobial activity
- all peptides remained green (bacteria survived)
- MIC > 128 Β΅g/mL β no effective killing
π even top ML predictions failed
𧬠No cytotoxicity either
- no harmful effects on human cells
- IC50 values relatively high
π peptides are biologically inactive (safe but useless as AMPs)
π§© Some amyloids still aggregate
- 4/10 formed fibrils
- aggregation β antimicrobial activity
π structural similarity β functional similarity
β οΈ ML models struggle here
- short amyloids = edge cases
- predictions vary wildly across tools
π shows limits of current AMP predictors
π‘ Key insight
π This is a negative result and thatβs the point
These peptides:
- look like AMPs
- are predicted like AMPs
- behave like AMPs structurally
BUT:
π they are NOT AMPs
π Why this matters
π§ Better ML models
These sequences are:
π perfect βhard negativesβ
- highly similar to AMPs
- experimentally validated as non-AMPs
π ideal for training robust models
β οΈ Dataset problem
Only ~24 confirmed non-AMPs exist (!!!)
π massive bottleneck in AMP prediction
π£ Cultural shift in science
The paper strongly argues:
π publish negative results
- prevents bias
- improves ML datasets
- accelerates discovery
π BioGenies perspective
This paper is π₯ for one reason:
π it exposes a hidden failure mode in bio-ML
- models β biology
- predictions β function
- similarity β activity
π And more importantly:
negative data = high-value data
This is exactly what:
- π§ interpretable ML
- 𧬠robust datasets
- π¬ real-world validation
should look like
