Wanted - dataset(s) to test bacterial genome analysis / annotation methods. Ideally has many genomes from collection of [interesting] bacteria with associated experimental phenotypes / metadata.On Linked In:
- Jonathan Jacobs:"
- All the reference genomes in the ATCC Genome Portal are freely available for non-commercial research purposes. They are also fully authenticated and traceable to physical production lots in our biorepository, and produced under ISO quality management. I’m biased, but I think we’re producing the quality microbial genomes I’ve ever seen on a regular basis - so you might want to look there. We’re producing about 100-150 new genomes every month - and many of them (about ~1/3rd) are for organisms with no preexisting genome. We have about 3,300 microbial genomes now (bacteria, viruses, fungi, protists), and all bacteria and fungi are sequenced kn both Illumina and Nanopore. Drop me an email or DM if you want to learn more or collaborate or something. Here’s a link: https://genomes.atcc.org (and here’s a link to a comparative genomics paper we published in mSpheres earlier https://journals.asm.org/doi/full/10.1128/msphere.00077-22)"
- And then "I should also add that we have tons of metadata and I’ve hired a full time data curator to help bring metadata we have in our historical records warehouse into our digital records (ie phenotypic data from routine QA/QC testing going back to the 1920s…) "
- Natalie Ma wrote
- Joint Genome Institute may have this (if you're fine with an environmental microbes focus). Adam Deutschbauer has done several Tnseq libraries for the bugs and characterized their phenotypes.
- Eoin Brodie pointed to https://fit.genomics.lbl.gov/cgi-bin/myFrontPage.cgi
- Amy Lee wrote: https://twitter.com/minisciencegirl/status/1666186747733061640
- Would something like this help? http://m.genome.cshlp.org/content/early/2017/03/15/gr.213363.116 All the genomic data have been deposited and phenotype data in supplementary tables (not in PDF version).
- Grace Morales wrote https://twitter.com/Ungraceful_Bio/status/1666180695985897474
- Have you looked at using PATRIC/ BV-BRC? Theyve got a ton of different bacterial genomes available and you can search by data you'd like(AMR phenotypes, experiments done). I've pulled data for some AMR prediction work. Link: http://bv-brc.org
- Mike wrote https://twitter.com/mikethemadbiol/status/1666190833064849412
- This will get you a list of every Pathogen Detection isolate at NCBI with a genome and matching AST data https://ncbi.nlm.nih.gov/pathogens/isolates/#property:(%22has%20AST%20phenotypes%22)
- Also from Grace Morales https://twitter.com/Ungraceful_Bio/status/1666480238417829891
- Another one I thought of - PubMLST. (http://pubmlst.org)