forked from calipho-sib/controlled-vocabulary
-
Notifications
You must be signed in to change notification settings - Fork 0
/
cv_datasources.txt
579 lines (572 loc) · 13.7 KB
/
cv_datasources.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
----------------------------------------------------------------------------
This document lists the controlled vocabulary used to defined imported data
(cv_datasources) in neXtProt database
Release: 07-Nov-2016
The definition of the CV is provided in the following format:
----------- ---------------------------------- -----------------------------
Line code Content Occurrence in an entry
----------- ---------------------------------- -----------------------------
ID Unique identifier Once; starts a cv data release entry
AC Unique accession (CVDS-xxxx) Once
DE Definition Once or more
DR Data repository / ftp site where the Once
DO Document (MDATA) Once
// Terminator Once; ends an entry
next id: CVDS-0112
__________________________________________________________________________
ID AgBase
AC CVDS-0040
DE AgBase is a curated resource for functional analysis of agricultural plant and animal gene products.
DR http://agbase.msstate.edu/
//
ID Alzheimers_University_of_Toronto
AC CVDS-0078
DE Alzheimers Project at University of Toronto
DR http://www.ims.utoronto.ca/Page4.aspx
//
ID Antibodypedia
AC CVDS-0077
DE A searchable database of antibodies against human proteins
DR http://www.antibodypedia.com
//
ID Bgee
AC CVDS-0008
DE Information imported from dataBase for Gene Expression Evolution
DO MDATA_0038
DR
//
ID BHF-UCL
AC CVDS-0041
DE British Heart Foundation - University College London
DR http://wiki.geneontology.org/index.php/BHF-UCL
//
ID CACAO
AC CVDS-0100
DE Community Assessment of Community Annotation with Ontologies
DR http://gowiki.tamu.edu/wiki/index.php/Category:CACAO
//
ID CCDS
AC CVDS-0016
DE Consensus CDS project
DR
//
ID ClinVar
AC CVDS-0081
DE Database of mutations and their clinical relevance
DR http://www.ncbi.nlm.nih.gov/clinvar/
//
ID Cosmic
AC CVDS-0051
DE Catalogue Of Somatic Mutations In Cancer
DR http://www.sanger.ac.uk/genetics/CGP/cosmic/
//
ID DFLAT
AC CVDS-0042
DE Developmental FunctionaL Annotation at Tufts
DR http://dflat.cs.tufts.edu/data.htm
//
ID dictyBase
AC CVDS-0043
DE Dictyostelium discoideum
DR http://dictybase.org/
//
ID dbSNP
AC CVDS-0052
DE Short genetic variations database
DR www.ncbi.nlm.nih.gov/SNP/
//
ID DrugBank
AC CVDS-0107
DE Drug and drug target database
DR http://www.drugbank.ca/
//
ID Dyp
AC CVDS-0022
DE Information extracted form the Dyp database
DO MDATA_0007
//
ID EMBL-EBI
AC CVDS-0006
DE European Molecular Biology Laboratory / European Bioinformatics Institute
DR
//
ID Ensembl
AC CVDS-0003
DE Information imported from Ensembl
DR http://www.ensembl.org
//
ID eVOC
AC CVDS-0014
DE Information generated by eVOC ontology
DR
//
ID FlyBase
AC CVDS-0059
DE A database for drosophila genetics and molecular biology.
DR http://flybase.org/
//
ID GDB
AC CVDS-0044
DE Human Genome Database
DR http://www.gdb.org
//
ID GFP-cDNA@EMBL
AC CVDS-0018
DE GFP-cDNA Localisation Project (EMBL)
DO MDATA_0001
//
ID GO
AC CVDS-0013
DE Information generated by Gene Ontology
DR
//
ID GO_central
AC CVDS-0097
DE Reference Genome Annotation Project
DR http://geneontology.org/page/reference-genome-annotation-project
//
ID GOC
AC CVDS-0045
DE inferred annotations from GO OBO v1.2
DR http://www.geneontology.org
//
ID HGNC
AC CVDS-0046
DE HUGO Gene Nomenclature Committee
DR http://www.gene.ucl.ac.uk/nomenclature
//
ID HPRD
AC CVDS-0015
DE Human Protein Reference Database
DR
//
ID Human protein atlas
AC CVDS-0005
DE Expression data imported from Human protein atlas
DR http://www.proteinatlas.org
DO MDATA_0005
//
ID Human protein atlas subcellular localization
AC CVDS-0057
DE Subcellular localization data imported from Human protein atlas
DR http://www.proteinatlas.org
DO MDATA_0006
//
ID InterPro
AC CVDS-0011
DE Information generated by InterPro
DR
//
ID IntAct
AC CVDS-0038
DE Information generated by IntAct
DR
//
ID IPI
AC CVDS-0082
DE International Protein Index, from EBI, no more maintain
DR
//
ID KEGG_PTW
AC CVDS-0024
DE Link to KEGG pathways
DR
//
ID LIFEdb
AC CVDS-0047
DE LIFEdb
DR http://www.lifedb.de
//
ID MDATA_0004_2011
AC CVDS-0023
DE Information extracted from the submission MDATA_0004_2011
DO MDATA_0004
//
ID MDATA_0023_2012
AC CVDS-0058
DE Proteomics information extracted from the submission MDATA_0023_2012
DO MDATA_0023
//
ID MDATA_0033_2013
AC CVDS-0065
DE Information extracted from the submission MDATA_0033_2013
DO MDATA_0033
//
ID MEROPS
AC CVDS-0095
DE Information resource for peptidases
DR http://merops.sanger.ac.uk/
//
ID MeSH
AC CVDS-0012
DE Information generated by Medical Subject Headings
//
ID MGI
AC CVDS-0025
DE Data from the Mouse orthologs database
DR
//
ID MTBbase
AC CVDS-0071
DE Collection and Refinement of Physiological Data on Mycobacterium tuberculosis
DR http://www.ark.in-berlin.de/Site/MTBbase.html
//
ID NCBI
AC CVDS-0017
DE National Center for Biotechnology Information
DR
//
ID NCI
AC CVDS-0079
DE National Center Institute
DR
//
ID NextProt
AC CVDS-0001
DE Information generated automacally or manually by neXtProt
DR
//
ID NextProt integration
AC CVDS-0007
DE Data integrated by neXtProt from various sources
DR
//
ID NTNU_SB
AC CVDS-0054
DE NTNU_SB ontology
DR
//
ID OBO
AC CVDS-0037
DE OBO supports community members who are developing and publishing ontologies in the biomedical domain.
DR http://obofoundry.org/
//
ID Orphanet
AC CVDS-0106
DE Orphanet; a database dedicated to information on rare diseases and orphan drugs
DR http://www.orpha.net/consor/cgi-bin/home.php?Lng=GB
//
ID ParkinsonsUK-UCL
AC CVDS-0076
DE Parkinsons Disease Gene Ontology Initiative
DR
//
ID PeptideAtlas
AC CVDS-0035
DE Human peptide sequences and identifiers from PeptideAtlas
DR
//
ID PeptideAtlas human adrenal gland
AC CVDS-0083
DE Information generated by PeptideAtlas, project human adrenal gland
DO MDATA_0052
//
ID PeptideAtlas human brain
AC CVDS-0036
DE Information generated by PeptideAtlas, project human brain
DO MDATA_0017
//
ID PeptideAtlas human breast
AC CVDS-0084
DE Information generated by PeptideAtlas, project human breast
DO MDATA_0051
//
ID PeptideAtlas human digestive system
AC CVDS-0085
DE Information generated by PeptideAtlas, project human digestive system
DO MDATA_0050
//
ID PeptideAtlas human eye
AC CVDS-0086
DE Information generated by PeptideAtlas, project human eye
DO MDATA_0042
//
ID PeptideAtlas human female reproductive system
AC CVDS-0087
DE Information generated by PeptideAtlas, project human female reproductive system
DO MDATA_0048
//
ID PeptideAtlas human heart
AC CVDS-0088
DE Information generated by PeptideAtlas, project human heart
DO MDATA_0043
//
ID PeptideAtlas human kidney
AC CVDS-0069
DE Information generated by PeptideAtlas, project human liver
DO MDATA_0035
//
ID PeptideAtlas human liver
AC CVDS-0070
DE Information generated by PeptideAtlas, project human liver
DO MDATA_0036
//
ID PeptideAtlas human lung
AC CVDS-0089
DE Information generated by PeptideAtlas, project human lung
DO MDATA_0044
//
ID PeptideAtlas human male reproductive system
AC CVDS-0090
DE Information generated by PeptideAtlas, project human male reproductive system
DO MDATA_0049
//
ID PeptideAtlas human others
AC CVDS-0056
DE Information generated by PeptideAtlas, project human others
DO MDATA_0022
//
ID PeptideAtlas human pancreas
AC CVDS-0091
DE Information generated by PeptideAtlas, project human pancreas
DO MDATA_0045
//
ID PeptideAtlas human plasma
AC CVDS-0019
DE Information generated by PeptideAtlas, project human plasma
DO MDATA_0008
//
ID PeptideAtlas human spleen
AC CVDS-0092
DE Information generated by PeptideAtlas, project human spleen
DO MDATA_0046
//
ID PeptideAtlas human urinary bladder
AC CVDS-0093
DE Information generated by PeptideAtlas, project human urinary bladder
DO MDATA_0047
//
ID PeptideAtlas human urine
AC CVDS-0055
DE Information generated by PeptideAtlas, project human urine
DO MDATA_0021
//
ID PeptideAtlas human phosphoproteome
AC CVDS-0104
DE Information generated by PeptideAtlas, project human phosphoproteome
//
ID PhosphoSite
AC CVDS-0096
DE Information resource for protein phosphorylation
DR http://www.phosphosite.org
//
ID PINC
AC CVDS-0048
DE Proteome Inc.
DR
//
ID PIR
AC CVDS-0009
DE Protein Information Resource
DR
//
ID PMID_18614565
AC CVDS-0034
DE Information extracted from the PubMed publication 18614565
DO MDATA_0013
//
ID PMID_19413330
AC CVDS-0075
DE Information extracted from the PubMed publication 19413330
DO MDATA_0025
//
ID PMID_19608861
AC CVDS-0032
DE Information extracted from the PubMed publication 19608861
DO MDATA_0011
//
ID PMID_20068231
AC CVDS-0026
DE Information extracted from the PubMed publication 20068231
DO MDATA_0016
//
ID PMID_20140087
AC CVDS-0029
DE Information extracted from the PubMed publication 20140087
DO MDATA_0009
//
ID PMID_20570859
AC CVDS-0020
DE Information extracted from the PubMed publication 20570859
DO MDATA_0002
//
ID PMID_20687582
AC CVDS-0031
DE Information extracted from the PubMed publication 20687582
DO MDATA_0010
//
ID PMID_20797634
AC CVDS-0030
DE Information extracted from the PubMed publication 20797634
DO MDATA_0014
//
ID PMID_20972266
AC CVDS-0027
DE Information extracted from the PubMed publication 20972266
DO MDATA_0012
//
ID PMID_21139048
AC CVDS-0028
DE Information extracted from the PubMed publication 21139048
DO MDATA_0015
//
ID PMID_21406692
AC CVDS-0021
DE Information extracted from the PubMed publication 21406692
DO MDATA_0003
//
ID PMID_21645671
AC CVDS-0033
DE Information extracted from the PubMed publication 21645671
DO MDATA_0018
//
ID PMID_21890473
AC CVDS-0074
DE Information extracted from the PubMed publication 21890473
DO MDATA_0020
//
ID PMID_22148984
AC CVDS-0060
DE Information extracted from the PubMed publication 22148984
DO MDATA_0026
//
ID PMID_22199227
AC CVDS-0061
DE Information extracted from the PubMed publication 22199227
DO MDATA_0027
//
ID PMID_22468782
AC CVDS-0062
DE Information extracted from the PubMed publication 22468782
DO MDATA_0028
//
ID PMID_22865923
AC CVDS-0063
DE Information extracted from the PubMed publication 22865923
DO MDATA_0024
//
ID PMID_23236377
AC CVDS-0073
DE Information extracted from the PubMed publication 23236377
DO MDATA_0034
//
ID PMID_23312004
AC CVDS-0064
DE Information extracted from the PubMed publication 23312004
DO MDATA_0029
//
ID PMID_23584533
AC CVDS-0066
DE Information extracted from the PubMed publication 23584533
DO MDATA_0030
//
ID PMID_23153008
AC CVDS-0067
DE Information extracted from the PubMed publication 23153008
DO MDATA_0031
//
ID PMID_23266961
AC CVDS-0068
DE Information extracted from the PubMed publication 23266961
DO MDATA_0032
//
ID PMID_24129315
AC CVDS-0080
DE Information extracted from the PubMed publication 24129315
DO MDATA_0039
//
ID PMID_25218447
AC CVDS-0099
DE Information extracted from the PubMed publication 25218447
DO MDATA_0053
//
ID PMID_23955771
AC CVDS-0102
DE Information extracted from the PubMed publication 23955771
DO MDATA_0054
//
ID PMID_25038526
AC CVDS-0103
DE Information extracted from the PubMed publication 25038526
DO MDATA_0055
//
ID Prosite
AC CVDS-0010
DE Information generated by Prosite
DR
//
ID PubMed
AC CVDS-0004
DE Information imported from PubMed
DR
//
ID Reactome
AC CVDS-0105
DE Reactome - a knowledgebase of biological pathways and processes
DR http://www.reactome.org/
//
ID RefGenome
AC CVDS-0049
DE GO Consortium Reference Genomes project
DR http://www.geneontology.org/GO.refgenome.shtml
//
ID Roslin_Institute
AC CVDS-0050
DE Roslin Institute
DR http://rgd.mcw.edu
//
ID SGD
AC CVDS-0098
DE Saccharomyces Genome Database
DR http://www.yeastgenome.org/
//
ID SRMAtlas
AC CVDS-0094
DE Human peptide sequences and identifiers from SRMAtlas
DR
//
ID String
AC CVDS-0039
DE Information generated by String
DR
//
ID SYSCILIA_CCNET
AC CVDS-0101
DE A systems biology approach to dissect cilia function. Imported from GOA.
DR http://syscilia.org/
//
ID Uniprot
AC CVDS-0002
DE Information imported from Uniprot
DR http://www.uniprot.org
//
ID UniPathway
AC CVDS-0053
DE A resource for the exploration of metabolic pathways
DR http://www.grenoble.prabi.fr/obiwarehouse/unipathway
//
ID WormBase
AC CVDS-0072
DE A comprehensive resource for genetics, genomics and biology of C. elegans and related nematodes
DR http://http://www.wormbase.org
//
ID PRO
AC CVDS-0108
DE Protein Ontology, a unique database resource for species-specific protein complexes
DR https://omictools.com/protein-ontology-tool
//
ID PhosphoSitePlus
AC CVDS-0109
DE A protein modification resource
DR http://www.phosphosite.org/homeAction.action
//
ID iPTMnet
AC CVDS-0110
DE A bioinformatics resource for integrated understanding of protein post-translational modifications (PTMs) in systems biology context.
DR http://research.bioinformatics.udel.edu/iptmnet/
//
ID PubTator
AC CVDS-0111
DE A Web-based tool for accelerating manual literature curation through the use of advanced text-mining techniques
DR https://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/PubTator/guest2.cgi
//