Questions relating to information regarding, and use of resources from, the National Center for Biotechnology Information (NCBI). Please take care to ensure that questions are on-topic; see the help pages for more information.
Questions tagged [ncbi]
26 questions
6
votes
1 answer
determining horizontal gene transfer between two genomes?
I want to determine if there is horizontal gene transfer between two genomes, in particular I was to use the parametric methods described in wikipedia since I would most likely struggle to understand the phylogenetic methods fully.
The two genomes…
Ro Siv
- 1,279
- 2
- 16
- 34
5
votes
3 answers
Retrieve all predicted cds from NCBI
Please apologies if this has been answered somewhere else, but I couldn't find an answer to this problem.
I would like to retrieve all the predicted coding sequences on the NCBI ftp for a given species. Let's say my interest species today is…
tlorin
- 163
- 5
3
votes
2 answers
Difference between NCBI's /genomes and /1000genomes
Wondering what the difference is in the data hosted here:
ftp://ftp-trace.ncbi.nih.gov/1000genomes
ftp://ftp.ncbi.nih.gov/genomes/
Also (sidenote), would be interested to note what the difference between ftp-trace and just ftp is. But there is…
Lance
- 733
- 3
- 12
3
votes
1 answer
Why is GenBank growth slowing down?
https://www.ncbi.nlm.nih.gov/genbank/statistics/ shows the growth of the GenBank database is slowing since WGS (Whole Genome Shotgun) emerged. Is this happening because sequencing centers are submitting their data unannotated (WGS submissions are…
player87
- 246
- 2
- 6
3
votes
1 answer
How to get the correct RefSeq Protein transcript for a given RefSeq Nucleotide transcript?
How to get versioned Protein Accession Number for a Refseq Accession Number?
I have some versioned RefSeq Accession numbers and I would like to know their corresponding Protein Accession Numbers.
According to the RefSeqFAQ …
ChrisGuest
- 157
- 7
2
votes
1 answer
Taxonomy problems on algae: Is Cryptophyta a Phylum or a Class rank?
I am currently interested in algae taxonomy (I am not an expert in the field).
I retrieved the taxonomy of my sequences with their NCBI access number via the taxonkit software (https://bioinf.shenwei.me/taxonkit/usage/).
I noticed that I get…
Tof
- 121
- 1
2
votes
1 answer
What does "Origin" mean in NCBI GenBank?
I'm having trouble determining what exactly is meant by the term "Origin" in this case. It is just before a 70 bp sequence that the page claims is a plasmid, but does not specify it to be what I assume is incomplete, seeing as I don't think there is…
CDB
- 1,836
- 10
- 20
2
votes
2 answers
How to search NCBI in bulk for a list of accession numbers?
I have a large ( >100 ) list of accession numbers I want to look up and match to searches in NCBI (nucleotide); mainly for getting a tentative organism to match to the accession number.
ex:
KJ841938.1 would match to Setoptus…
Ro Siv
- 1,279
- 2
- 16
- 34
1
vote
0 answers
Creating a phylogenetic tree from my selected publicly-available sequences (WGS) in NCBI
I'm currently writing a paper on the comparison of virulence genes for a group of bacteria. I got my data from publicly-available whole genome sequences in NCBI. Now, I want to create a phylogenetic tree for these species but since I'm working from…
rimuru
- 21
- 2
1
vote
1 answer
How to download different kinds of data from NCBI eutils?
I have been researching NCBI eutils and wish to get some 'big data' from it. I know that I can construct queries to query one of (I think) 8 databases, like this:…
Joey Gough
- 143
- 5
1
vote
1 answer
From refseq ids to Go term Ids
I have a list of Refseq accession numbers such as :
YP_009448812
YP_009448725
YP_009448701
NP_659591
around 10 000 acc_numbers...
and I'm looking for a tools in R or Python in order to get the corresponding Go term ids.
I tried the packages…
Grendel
- 115
- 4
1
vote
1 answer
Open Mass Spec Database
Wondering if there is an open (freely available) database of Mass Spec Data, particularly for the Substances/Compounds listed in NCBI.
Specifically, I am not sure if NCBI includes the mass spectrometry data / datafiles in their FTP server related to…
Lance
- 733
- 3
- 12
1
vote
2 answers
Why was Achromobacter xerosis removed from the NCBI taxonomy?
The Global Catalogue of Microorganisms lists a bacterium called Achromobacter xerosis which is mentioned in several papers and patents. It once existed in the NCBI taxonomy database, with ID 216898. However, it is no longer there - going to…
Mark Amery
- 122
- 7
1
vote
2 answers
moltype controlled vocabulary?
In submitting sequences using tbl2asn to NCBI/GenBank, the documentation states that there is a controlled vocabulary for the key "moltype", but no where on the Internet can I find a full list of that vocabulary. "mRNA" and "genomic" are given as…
David Maddison
- 11
- 1
1
vote
2 answers
Blast databases
I am helping a colleague setup a local blast server. My background is computer science so I apologize if I use incorrect terminology.
Using the NCBI blastn webpage, one of the databases listed is "NCBI Genomes (chromosome)." I'm unable to find this…
cbake
- 11
- 1