tag:blogger.com,1999:blog-10781944.post9012205779373664088..comments2024-03-28T00:36:36.460-07:00Comments on The Tree of Life: Question - anyone having issues w/ delays/difficulty in the process of getting genomes / metagenomes into Genbank?Jonathan Eisenhttp://www.blogger.com/profile/07953790938128734305noreply@blogger.comBlogger4125tag:blogger.com,1999:blog-10781944.post-36204984044935281292013-03-27T23:39:26.363-07:002013-03-27T23:39:26.363-07:00I can see the point(s) about ease of submission, b...I can see the point(s) about ease of submission, but the major issue with lots of smaller DBs is sustainability, who will look after a database longer term? <br />There is also the point that being in a single format allows for direct comparisons to be made.<br />And finally, from an end users point of view, wouldn't you rather be able to go to one (or just a few) place to find all the genomes of interest? <br />Now I've not tried to submit data to NCBI so I dont know your pain, but having worked in the ENA at EBI, I know it really isn't so hard to submit data there.<br />Regarding @caseybergman 's comment about @GigaScience , we can provide an option for those submitting papers to the Giga Science journal, but we're still encouraging submission of raw data to the SRA.Anonymoushttps://www.blogger.com/profile/17777562609884731424noreply@blogger.comtag:blogger.com,1999:blog-10781944.post-14899128691369993362013-03-27T09:37:35.913-07:002013-03-27T09:37:35.913-07:00You don't have to submit to GenBank ... the Eu...You don't have to submit to GenBank ... the European Nucleotide Archive is much easier to work with than GenBank or the SRA. There are a few publications where people use SEED, RAST, microbes online, IMG, etc etc etc to announce bacterial genomes.<br /><br />...rant...<br />If GenBank is an archive of your annotations why do they make you use PGAAP to annotate. If you don't need to use PGAAP why don't they accept annotations directly from third party tools (RAST, microbes online, etc)? <br />.../rant...<br /><br /><br />The days of monolithic databases holding all sequence data known are nearing an end. The question is, how can scientists still get access to all the sequence data they are interested in?<br /><br />The problem with github/figshare/etc/etc/ is generating a common dataset that holds all (or most) of the sequence data for new comparisons. <br /><br />There is no technical reason we have to submit to GenBank, we should be able to use whatever database has the best access. Provided they are listed in a common aggregator of web services e.g. http://www.biocatalogue.org/) and they provide an API for programmatic access then everyone can access the data from anywhere. [In principle we could use RDF but that does not work in practice.]<br /><br />Free the data, make smaller, open databases, but make sure they are linked and accessible to all.<br /><br />Rob Edwardshttps://www.blogger.com/profile/05352256481639713680noreply@blogger.comtag:blogger.com,1999:blog-10781944.post-60874137798790364852013-03-26T15:36:01.349-07:002013-03-26T15:36:01.349-07:00Absolutely - but given that everyone is wasting mo...Absolutely - but given that everyone is wasting months to years on just getting data into Genbank - we need to either agree that people won't do that or find another way to share.Jonathan Eisenhttps://www.blogger.com/profile/07953790938128734305noreply@blogger.comtag:blogger.com,1999:blog-10781944.post-47920527089927514132013-03-26T15:33:35.359-07:002013-03-26T15:33:35.359-07:00As much as it saddens me to say this (since a coll...As much as it saddens me to say this (since a colleague refers to me as the dumpster diver of genomic data), I think that scientists need to have a frank conversation about the costs and benefits of saving every piece of genomic data and curating it.Julie Dunning Hotopphttps://www.blogger.com/profile/10832485952012650331noreply@blogger.com