有时候,会获得bed文件,bed文件是染色体的位置信息。
当我们想得到对应的fasta序列是,需要用到的程序是bedtools中的
getfasta
当我们想得到对应的fasta序列是,需要用到的程序是bedtools中的
getfasta
getfast的原文地址http://bedtools.readthedocs.org/en/latest/content/tools/getfasta.html
很容易理解。
bedtools getfasta [OPTIONS] -fi <input FASTA> -bed <BED/GFF/VCF> -fo <output FASTA>
<input FASTA> 就是参考序列,比如hg19.fa <BED/GFF/VCF> 是你的bed文件
<output FASTA> 是你输出的文件的名字
[OPTIONS] 为可调节选项,功能如下:
-name
Use the “name” column in the BED file for the FASTA headers in the output FASTA file.
-tab
Report extract sequences in a tab-delimited format instead of in FASTA format.
-s
Force strandedness. If the feature occupies the antisense strand, the sequence will be reverse complemented. Default: strand information is ignored.
-split
Given BED12 input, extract and concatenate the sequences from the BED “blocks” (e.g., exons)
很容易理解。
bedtools getfasta [OPTIONS] -fi <input FASTA> -bed <BED/GFF/VCF> -fo <output FASTA>
<input FASTA> 就是参考序列,比如hg19.fa <BED/GFF/VCF> 是你的bed文件
<output FASTA> 是你输出的文件的名字
[OPTIONS] 为可调节选项,功能如下:
-name
Use the “name” column in the BED file for the FASTA headers in the output FASTA file.
-tab
Report extract sequences in a tab-delimited format instead of in FASTA format.
-s
Force strandedness. If the feature occupies the antisense strand, the sequence will be reverse complemented. Default: strand information is ignored.
-split
Given BED12 input, extract and concatenate the sequences from the BED “blocks” (e.g., exons)