The data is structured on both the TERRA-REF strorage (accessible via Globus and Workbench) and CyVerse Data Store infrastructures as follows:
1
|-terraref
2
| |-genomics
3
| | |-raw_data
4
| | | |-bap
5
| | | | |-resequencing
6
| | | |-ril
7
| | | | |-gbs
8
| | |-derived_data
9
| | | |-bap
10
| | | | |-resequencing
11
| | | | | |-danforth_center
12
| | | |-ril
13
| | | | |-gbs
14
| | | | | |-kansas_state
Copied!
Whole-genome resequencing
Raw data
Raw data are in bzip2 FASTQ format, one per read pair (*_R1.fastq.bz2 and *_R2.fastq.bz2). 384 samples are available. For a list of the lines sequenced, see the sample table.
Derived data
Data derived from analysis of the raw resequencing data at the Danforth Center (version1) are available as gzipped, genotyped variant call format (gVCF) files and the final combined hapmap file.
Genotyping-by-sequencing (GBS)
Raw data
Raw data are in gzip FASTQ format. 768 samples are available. For a list of lines sequenced, see the sample table.
Derived data
Combined genotype calls are available in VCF format.