ããã¯ãUbuntu OnlineãFedora OnlineãWindows ãªã³ã©ã€ã³ ãšãã¥ã¬ãŒã¿ãŒãMAC OS ãªã³ã©ã€ã³ ãšãã¥ã¬ãŒã¿ãŒãªã©ã®è€æ°ã®ç¡æãªã³ã©ã€ã³ ã¯ãŒã¯ã¹ããŒã·ã§ã³ã® XNUMX ã€ã䜿çšããŠãOnWorks ç¡æãã¹ãã£ã³ã° ãããã€ããŒã§å®è¡ã§ããã³ãã³ã vcftools ã§ãã
ããã°ã©ã ïŒ
NAME
vcftools - VCF ãã¡ã€ã«ãåæãã
SYNOPSIS
vcfããŒã« [OPTIONS]
DESCRIPTION
vcftools ããã°ã©ã ã¯ã³ãã³ã ã©ã€ã³ããå®è¡ãããŸãã ã€ã³ã¿ãŒãã§ã€ã¹ã¯ PLINK ããã€ã³ã¹ãã¬ãŒã·ã§ã³ãåããŠããã
ãã®ããããã®ããã±ãŒãžã®ãŠãŒã¶ãŒã«ã¯ããç¥ãããŠããã¯ãã§ãã ã³ãã³ãã¯æ¬¡ã®åœ¢åŒãåããŸãã
vcftools --vcf file1.vcf --chr 20 --freq
äžèšã®ã³ãã³ãã¯ãvcftools ã« file1.vcf ãã¡ã€ã«ãèªã¿åãããµã€ããæœåºããããã«æ瀺ããŸãã
20çªæè²äœã調ã¹ãåéšäœã®å¯Ÿç«éºäŒåé »åºŠãèšç®ããŸãã çµæãšããŠçãã察ç«éºäŒå
åšæ³¢æ°æšå®å€ã¯åºåãã¡ã€ã« out.freq ã«ä¿åãããŸãã äžã®äŸã®ããã«ã
vcftools ããã®åºåã¯ã
ãéžæããŠãåŸåŠçç»é¢ã«é²ã¿ãŸãã
äžéšã®ã³ãã³ãã¯ãvcftools ã®ææ°ããŒãžã§ã³ã§ã®ã¿äœ¿çšã§ããå Žåãããããšã«æ³šæããŠãã ããã å ¥æããã«ã¯
ææ°ããŒãžã§ã³ã®å Žåã¯ãããã§èª¬æãããŠããããã«ãSVN ã䜿çšããŠææ°ã®ã³ãŒãããã§ãã¯ã¢ãŠãããå¿ èŠããããŸãã
ããŒã ããŒãžã
ãŸããåæ°äœã®éºäŒååã¯çŸåšãµããŒããããŠããªãããšã«ã泚æããŠãã ããã
Basic ãªãã·ã§ã³
--vcf
ãã®ãªãã·ã§ã³ã¯ãåŠçãã VCF ãã¡ã€ã«ãå®çŸ©ããŸãã ãã¡ã€ã«ã解åããå¿ èŠããã
vcftools ã䜿çšããåã«ã vcftools ã¯ãVCF åœ¢åŒ v4.0 ã®ãã¡ã€ã«ãæ³å®ããŠããŸãã
ãã®ä»æ§ã¯ããã§ã芧ããã ããŸãã
--gzvcf
ãã®ãªãã·ã§ã³ã¯ã--vcf ãªãã·ã§ã³ã®ä»£ããã«äœ¿çšããŠãå§çž® (gzip å§çž®) ãèªã¿åãããšãã§ããŸãã
VCF ãã¡ã€ã«ãçŽæ¥ã ãã®ãªãã·ã§ã³ã¯ã倧èŠæš¡ãªç°å¢ã§äœ¿çšãããšéåžžã«é ããªãå¯èœæ§ãããããšã«æ³šæããŠãã ããã
ãã¡ã€ã«ã
- ã¢ãŠã
ãã®ãªãã·ã§ã³ã¯ãvcftools ã«ãã£ãŠçæããããã¹ãŠã®ãã¡ã€ã«ã®åºåãã¡ã€ã«åã®ãã¬ãã£ãã¯ã¹ãå®çŸ©ããŸãã
ããšãã°ã次ã®å Žåãoutput_filenameã«èšå®ãããŠããå Žåããã¹ãŠã®åºåãã¡ã€ã«ã¯
Output_filename.*** ã®åœ¢åŒã ãã®ãªãã·ã§ã³ãçç¥ããå Žåããã¹ãŠã®åºåãã¡ã€ã«ã¯
æ¥é èŸãout.ããä»ããŸãã
Site ãã£ã«ã¿ ãªãã·ã§ã³
--chr
äžèŽããæè²äœèå¥åãæã€ãµã€ãã®ã¿ãåŠçããŸã
--from-bp
--to-bp
ãããã®ãªãã·ã§ã³ã¯ãåŠçããããµã€ãã®ç©ççãªç¯å²ãå®çŸ©ããŸãã å€éšã®ãµã€ã
ãã®ç¯å²ã®ãã®ã¯é€å€ãããŸãã ãããã®ãªãã·ã§ã³ã¯ã以äžãšçµã¿åãããŠã®ã¿äœ¿çšã§ããŸãã
--chr.
--snp
äžèŽãã ID ãæ〠SNP ãå«ããŸãã ãã®ã³ãã³ãã¯é çªã«è€æ°å䜿çšã§ããŸãã
è€æ°ã® SNP ãå«ããã
--snps
ãã¡ã€ã«ã«æå®ããã SNP ã®ãªã¹ããå«ããŸãã ãã¡ã€ã«ã«ã¯ SNP ID ã®ãªã¹ããå«ãŸããŠããå¿ èŠããããŸãã
XNUMX è¡ã« XNUMX ã€ã® ID ãæå®ããŸãã
-é€å€ãã
ãã¡ã€ã«ã«æå®ããã SNP ã®ãªã¹ããé€å€ããŸãã ãã¡ã€ã«ã«ã¯ SNP ID ã®ãªã¹ããå«ãŸããŠããå¿ èŠããããŸãã
XNUMX è¡ã« XNUMX ã€ã® ID ãæå®ããŸãã
--äœçœ®
ããžã·ã§ã³ã®ãªã¹ãã«åºã¥ããŠäžé£ã®ãµã€ããå«ããŸãã å ¥åã®åè¡
ãã¡ã€ã«ã«ã¯ãïŒã¿ãã§åºåãããïŒæè²äœãšäœçœ®ãå«ãŸããŠããå¿ èŠããããŸãã ãã¡ã€ã«ã¯ã
ããããŒè¡ããããŸãã ãªã¹ãã«å«ãŸããŠããªããµã€ãã¯é€å€ãããŸãã
- ããã
--ããããé€ã
BED ãã¡ã€ã«ã«åºã¥ããŠãäžé£ã®ãµã€ããå«ãããé€å€ããŸãã æåã®XNUMXã€ã ã
å (chromãchromStartãããã³ chromEnd) ãå¿ èŠã§ãã BED ãã¡ã€ã«ã«ã¯
ããããŒè¡ã
--ãã£ã«ã¿ãŒåŠçããããã¹ãŠãåé€
--remove-filtered
--keep-filtered
ãããã®ãªãã·ã§ã³ã¯ãFILTER ãã©ã°ã«åºã¥ããŠãµã€ãããã£ã«ã¿ãªã³ã°ããããã«äœ¿çšãããŸãã ã®
æåã®ãªãã·ã§ã³ã¯ãFILTER ãã©ã°ãæã€ãã¹ãŠã®ãµã€ããåé€ããŸãã XNUMX çªç®ã®ãªãã·ã§ã³ã¯æ¬¡ã®ç®çã§äœ¿çšã§ããŸãã
ç¹å®ã®ãã£ã«ã¿ãŒ ãã©ã°ãæã€ãµã€ããé€å€ããŸãã XNUMX çªç®ã®ãªãã·ã§ã³ã䜿çšããŠéžæã§ããŸãã
ç¹å®ã®ãã£ã«ã¿ãŒ ãã©ã°ã«åºã¥ããŠãµã€ããæ€çŽ¢ããŸãã XNUMX çªç®ãš XNUMX çªç®ã®ãªãã·ã§ã³ã¯æ¬¡ã®ãšããã§ãã
è€æ°ã® FILTER ãæå®ããããã«è€æ°å䜿çšãããŸãã --keep-filtered ãªãã·ã§ã³ã¯æ¬¡ã®ãšããã§ãã
--remove-filtered ãªãã·ã§ã³ã®åã«é©çšãããŸãã
--minQ
ãã®ãããå€ãè¶ ããå質ãæã€ãµã€ãã®ã¿ãå«ããŸãã
--min-meanDP
--æ倧平åDP
ãããã®ãªãã·ã§ã³ã§å®çŸ©ããããããå€å ã®å¹³å深床ãæã€ãµã€ããå«ãŸããŸãã
--maf
--ããã¯ã¹ãã
æå®ãããç¯å²å ã®ãã€ããŒå¯Ÿç«éºäŒåé »åºŠãæã€ãµã€ãã®ã¿ãå«ããŸãã
--non-ref-af
--max-non-ref-af
æå®ãããç¯å²å ã®éåç §å¯Ÿç«éºäŒåé »åºŠãæã€ãµã€ãã®ã¿ãå«ããŸãã
-è²çž
ã§å®çŸ©ãããŠãããšãããå³å¯æ€å®ã䜿çšããŠããŒãã£ã»ã¯ã€ã³ããŒã°å¹³è¡¡ã®éšäœãè©äŸ¡ããŸãã
ãŠã£ã®ã³ãã³ãã«ãã©ãŒãã¢ãã«ã·ã¹ (2005)ã på€ãéŸå€ãäžåããµã€ã
ãã®ãªãã·ã§ã³ã§å®çŸ©ããããã®ã¯ HWE ã®å¯Ÿè±¡å€ãšã¿ãªãããé€å€ãããŸãã
--ãžã§ã
æ¬ æããŒã¿ã®å²åã«åºã¥ããŠãµã€ããé€å€ããŸã (以äžã®ç¯å²å ã§ãããšå®çŸ©ãããŸã)ã
0ãš1ïŒ
--min-察ç«éºäŒå
--max-察ç«éºäŒå
æå®ãããç¯å²å ã®å¯Ÿç«éºäŒåæ°ãæã€ãµã€ãã®ã¿ãå«ããŸãã ããã«
ããšãã°ã䞡察ç«éºäŒåéšäœã®ã¿ãå«ããã«ã¯ã次ã®ããã«äœ¿çšã§ããŸãã
vcftools --vcf file1.vcf --min-alleles 2 --max-alleles 2
- ãã¹ã¯
--å転ãã¹ã¯
--ãã¹ã¯å
FASTA ã®ãããªãã¡ã€ã«ã«åºã¥ããŠãµã€ããå«ããŸãã æäŸããããã¡ã€ã«ã«ã¯ã
æè²äœäžã®åäœçœ®ã®æŽæ° (0 ãã 9 ã®é) ã®ã·ãŒã±ã³ã¹ã
ãã®äœçœ®ã«ãããµã€ãããã£ã«ã¿ãªã³ã°ãããã©ãããæå®ããŸãã ãã¹ã¯ãã¡ã€ã«ã®äŸ
次ã®ããã«ãªããŸã:
>1
0000011111222 ...
ãã®äŸã§ã¯ãVCF ãã¡ã€ã«å ã®ãµã€ãã¯ã
æè²äœ 1 ã®å é ã¯ç¶æãããŸããã6 äœä»¥éã®éšäœã¯ç¶æãããŸãã
é€å€ãããã ãµã€ãããã£ã«ã¿ãªã³ã°ããããã©ããã決å®ããæŽæ°ã®ãããå€ã¯æ¬¡ã®ãšããã§ãã
--mask-min ãªãã·ã§ã³ã䜿çšããŠèšå®ããŸããããã©ã«ã㯠0 ã§ãã
ãã¹ã¯ ãã¡ã€ã«ã¯ VCF ãã¡ã€ã«ãšåãé åºã§äžŠã¹æ¿ããå¿ èŠããããŸãã --mask ãªãã·ã§ã³
ã¯äœ¿çšãããã¹ã¯ ãã¡ã€ã«ãæå®ããããã«äœ¿çšãããŸããã --invert-mask ãªãã·ã§ã³ã¯
é©çšåã«å転ããããã¹ã¯ ãã¡ã€ã«ãæå®ããããã«äœ¿çšãããŸãã
åã ã® ãã£ã«ã¿
--indv
åæã«æ®ãå人ãæå®ããŸãã ãã®ãªãã·ã§ã³ã¯è€æ°äœ¿çšã§ããŸã
è€æ°ã®å人ãæå®ããå Žåã«äœ¿çšããŸãã
- ä¿ã€
ãã®åŸã®åæã«å«ããå人ã®ãªã¹ããå«ããã¡ã€ã«ãæäŸããŸãã
åã ã® ID (VCF ããããŒã©ã€ã³ã§å®çŸ©ãããŠãã) ã¯ã
å¥ã®è¡ã
--remove-indv
åæããé€å€ããå人ãæå®ããŸãã ãã®ãªãã·ã§ã³ã¯äœ¿çšã§ããŸã
è€æ°ã®å人ãæå®ããã«ã¯ãè€æ°å䜿çšããŸãã --indv ãªãã·ã§ã³ãæå®ãããŠããå Žå
æå®ãããšã--remove-indv ãªãã·ã§ã³ã®åã« --indv ãªãã·ã§ã³ãå®è¡ãããŸãã
- åé€ãã
ãã®åŸã®åæã§é€å€ããå人ã®ãªã¹ããå«ããã¡ã€ã«ãæäŸããŸãã
åã ã® ID (VCF ããããŒã©ã€ã³ã§å®çŸ©ãããŠãã) ã¯ã
å¥ã®è¡ã --keep ãªãã·ã§ã³ãš --remove ãªãã·ã§ã³ã®äž¡æ¹ã䜿çšãããŠããå Žåã
--keep ãªãã·ã§ã³ã¯ã--remove ãªãã·ã§ã³ã®åã«å®è¡ãããŸãã
--mon-indv-meanDP
--max-indv-meanDP
å人ããšã«å¹³åé©çšç¯å²ãèšç®ããŸãã ãæã€å人ã®ã¿
ãããã®ãªãã·ã§ã³ã§æå®ãããç¯å²å ã®ã«ãã¬ããžã¯ãåŸç¶ã®ãªãã·ã§ã³ã«å«ãŸããŸãã
åæããŸãã
- ãã€ã³ã
åå人ã®æå°é話ã¬ãŒãã®ãããå€ãæå®ããŸãã
--段éç
ãŸã段éåãããŠããªããã¹ãŠã®éºäŒååãæã€ãã¹ãŠã®åäœãé€å€ãããã®åŸ
æªæ®µéã®éºäŒååãæã€ãã¹ãŠã®ãµã€ããé€å€ããŸãã ãããã£ãŠãæ®ãã®ããŒã¿ã¯æ¬¡ã®ããã«ãªããŸãã
段éçãªããŒã¿ã®ã¿ã
éºäŒåå ãã£ã«ã¿
--remove-filtered-geno-all
--remove-filtered-geno
æåã®ãªãã·ã§ã³ã¯ãFILTER ãã©ã°ãæã€ãã¹ãŠã®éºäŒååãåé€ããŸãã XNUMX çªç®ã®ãªãã·ã§ã³ã¯æ¬¡ã®ãšããã§ã
ç¹å®ã®ãã£ã«ã¿ãŒ ãã©ã°ã䜿çšããŠéºäŒååãé€å€ããããã«äœ¿çšãããŸãã
--minGQ
ãã®ãªãã·ã§ã³ã§æå®ããããããå€ãäžåãå質ãæã€ãã¹ãŠã®éºäŒååãé€å€ããŸã
ïŒGQïŒã
--minDP
ãã®ãªãã·ã§ã³ã§æå®ããã·ãŒã±ã³ã¹æ·±åºŠãããäœããã¹ãŠã®éºäŒååãé€å€ããŸãã
ïŒDPïŒ
åºå çµ±èš
--é »åºŠ
--ã«ãŠã³ã
--freq2
--counts2
ãµã€ãããšã®åšæ³¢æ°æ å ±ãåºåããŸãã --freq ã¯å¯Ÿç«éºäŒåã®é »åºŠãåºåããŸãã
æ¡åŒµåã.frqããä»ãããã¡ã€ã«ã --counts ãªãã·ã§ã³ã¯ã次ã®ãããªåæ§ã®ãã¡ã€ã«ãåºåããŸãã
æ¥å°ŸèŸã.frq.countãã«ã¯ãåãµã€ãã®çã®å¯Ÿç«éºäŒåæ°ãå«ãŸããŸãã --freq2
ããã³ --count2 ãªãã·ã§ã³ã¯ãåºåãã¡ã€ã«å ã®å¯Ÿç«éºäŒåæ å ±ãæå¶ããããã«äœ¿çšãããŸãã ã®
ãã®å Žåãåšæ³¢æ°/ã«ãŠã³ãã®é åºã¯ VCF ãã¡ã€ã«å ã®çªå·ä»ãã«ãã£ãŠæ±ºãŸããŸãã
- æ·±ã
å人ããšã®å¹³å深床ãå«ããã¡ã€ã«ãçæããŸãã ãã®ãã¡ã€ã«ã«ã¯æ¥å°ŸèŸãä»ããŠããŸã
ã.i Depthãã
-- ãµã€ãã®æ·±ã
-- ãµã€ãå¹³å深床
ãµã€ãããšã®æ·±ããå«ããã¡ã€ã«ãçæããŸãã --site- Depth ãªãã·ã§ã³ã¯ã
åãµã€ãã®æ·±ããåäººå šäœã§åèšãããã®ã ãã®ãã¡ã€ã«ã«ã¯ã.l Depthããšããæ¥å°ŸèŸãä»ããŠããŸãã
åæ§ã«ã --site-mean- Depth ã¯åãµã€ãã®å¹³å深床ãåºåããŸãã
åºåãã¡ã€ã«ã«ã¯æ¥å°ŸèŸã.l Depth.meanããä»ããŸãã
--éºäŒåã®æ·±ã
åéºäŒååã®æ·±ããå«ã (ããããéåžžã«å€§ããª) ãã¡ã€ã«ãçæããŸãã
VCF ãã¡ã€ã«ã æ¬ èœããŠãããšã³ããªã«ã¯å€ -1 ãäžããããŸãã ãã¡ã€ã«ã«ã¯æ¥å°ŸèŸãä»ããŠããŸã
ã.gæ·±ããã
--ãµã€ãã®å質
QUAL åã«ããããã«ããµã€ãããšã® SNP å質ãå«ããã¡ã€ã«ãçæããŸãã
VCF ãã¡ã€ã«ã®ã ãã®ãã¡ã€ã«ã«ã¯ã.lqualããšããæ¥å°ŸèŸãä»ããŠããŸãã
--ããã å人ããšã«ãããæ¥åæ§ã®å°ºåºŠãèšç®ããŸãã å ·äœçã«ã¯ã
è¿èŠªäº€é ä¿æ° F ã¯ã次ã®æ¹æ³ã䜿çšããŠååäœã«ã€ããŠæšå®ãããŸãã
ç¬éã çµæã®ãã¡ã€ã«ã«ã¯ã.hetããšããæ¥å°ŸèŸãä»ããŸãã
--äžå€«ãª
Hardy-Weinberg 平衡æ€å® (å®çŸ©ã©ãã) ããåãµã€ãã® p å€ãã¬ããŒãããŸãã
ãŠã£ã®ã³ãã³ãã«ãã©ãŒãã¢ãã«ã·ã¹è (2005))ã çµæã®ãã¡ã€ã« (æ¥å°ŸèŸã.hweãä»ã)
ãã¢æ¥åäœãšãããæ¥åäœã®èŠ³å¯æ°ãå«ãŸããŠããŸãã
HWE ã§ã®å¯Ÿå¿ããæåŸ ãããæ°å€ã
- ãªã
å人ããšããã³ãµã€ãããšã«æ¬ èœãå ±åãã XNUMX ã€ã®ãã¡ã€ã«ãçæããŸã
åºç€ã XNUMX ã€ã®ãã¡ã€ã«ã«ã¯ããããã.imissããšã.lmissããšãããµãã£ãã¯ã¹ãä»ããŠããŸãã
--hap-r2
--geno-r2
--ld-ãŠã£ã³ããŠ
--ld-ãŠã£ã³ããŠ-bp
--min-r2
ãããã®ãªãã·ã§ã³ã¯ãé£éäžåè¡¡ (LD) çµ±èšãã¬ããŒãããããã«äœ¿çšãããŸãã
r2 çµ±èšã«ãã£ãŠèŠçŽãããŸãã --hap-r2 ãªãã·ã§ã³ã¯ãvcftools ã«ã
段éçãããã¿ã€ãã䜿çšã㊠r2 çµ±èšãå ±åãããã¡ã€ã«ã ããã¯äŒçµ±çãªãã®ã§ã
éå£éºäŒåŠã®æç®ã§ããå ±åããã LD ã®å°ºåºŠã 段éçã«è¡ãå Žå
ãããã¿ã€ããå©çšã§ããªãå Žåã¯ã --geno-r2 ãªãã·ã§ã³ã䜿çšã§ããŸãã
0ã1ã2 ãšããŠãšã³ã³ãŒããããéºäŒååéã®äºä¹çžé¢ä¿æ°
ååäœã®éåç §å¯Ÿç«éºäŒåã®æ°ãè¡šããŸãã ãããåãã§ã
PLINK ã«ãã£ãŠå ±åããã LD 察çãšããŠã ãããã¿ã€ãã®ããŒãžã§ã³ã¯ã次ã®ãã¡ã€ã«ãåºåããŸãã
æ¥å°ŸèŸã.hap.ldããä»ããŠããŸãããéºäŒååããŒãžã§ã³ã§ã¯æ¥å°ŸèŸãä»ããŠãããã¡ã€ã«ãåºåãããŸãã
ã.geno.ldãã ãããã¿ã€ã ããŒãžã§ã³ã«ã¯ããªãã·ã§ã³ --phased ãå«ãŸããŸãã
--ld-window ãªãã·ã§ã³ã¯ã次ã®èšç®ã®ããã®æ倧 SNP åé¢ãå®çŸ©ããŸãã
LDã åæ§ã«ã --ld-window-bp ãªãã·ã§ã³ã䜿çšããŠãæ倧ç©çãŠã£ã³ããŠãå®çŸ©ã§ããŸãã
LD èšç®ã«å«ãŸãã SNP ã®åé¢ã æåŸã«ã --min-r2 ã¯
r2 ã®æå°å€ããã®å€ãäžåããš LD çµ±èšã¯å ±åãããŸããã
--SNPdnsity
ãã®ãªãã·ã§ã³ã§å®çŸ©ããããµã€ãºã®ãã³å ã® SNP ã®æ°ãšå¯åºŠãèšç®ããŸãã
çµæã®åºåãã¡ã€ã«ã«ã¯æ¥å°ŸèŸã.snpdenããä»ããŸãã
--TsTV
ããã§å®çŸ©ããããµã€ãºã®ãã³ã§é·ç§»/é·ç§»æ¯çãèšç®ããŸãã
ãªãã·ã§ã³ã çµæã®åºåãã¡ã€ã«ã«ã¯æ¥å°ŸèŸã.TsTvããä»ããŸãã æŠèŠã
æ¥å°ŸèŸã.TsTv.summaryããä»ãããã¡ã€ã«ã§æäŸãããŸãã
--FILTER-æŠèŠ
åãã£ã«ã¿ãŒ ã«ããŽãªã® SNP æ°ãš Ts/Tv æ¯ã®æŠèŠãçæããŸãã
åºåãã¡ã€ã«ã«ã¯æ¥å°ŸèŸã.FILTER.summaryããä»ããŸãã
--ãã£ã«ã¿ãŒããããµã€ã
ãã£ã«ã¿ãªã³ã°åŸã«ä¿æãŸãã¯åé€ããããµã€ãããªã¹ããã XNUMX ã€ã®ãã¡ã€ã«ãäœæããŸãã ã®
æåã®ãã¡ã€ã«ã¯ãµãã£ãã¯ã¹ã.kept.sitesããä»ããŠããããã£ã«ã¿ãŒåŸã« vcftools ã«ãã£ãŠä¿æããããµã€ãããªã¹ãããŸãã
é©çšãããŠããŸãã XNUMX çªç®ã®ãã¡ã€ã«ã«ã¯ã.removed.sitesããšããæ¥å°ŸèŸãä»ããŠããããµã€ãã®ãªã¹ãã衚瀺ãããŸãã
é©çšããããã£ã«ã¿ãŒã«ãã£ãŠé€å»ãããŸãã
--ã·ã³ã°ã«ãã³
ãã®ãªãã·ã§ã³ã¯ãã·ã³ã°ã«ãã³ã®å Žæã詳现ã«ç€ºããã¡ã€ã«ãçæããŸãã
ãã®ãã¡ã€ã«ã¯ãçã®ã·ã³ã°ã«ãã³ãšãã©ã€ããŒãã®äž¡æ¹ãå ±åããŸãã
ããã«ãã³ (ã€ãŸãããã€ããŒå¯Ÿç«éºäŒåã XNUMX 人ã®åäœã«ã®ã¿ååšãã
ãã®åäœã¯ãã®å¯Ÿç«éºäŒåã«é¢ããŠãã¢æ¥åæ§ã§ãã)ã åºåãã¡ã€ã«ã«ã¯æ¥å°ŸèŸãä»ããŸãã
'.singletons'ã
--site-pi
--ãŠã£ã³ããŠãã€
ãããã®ãªãã·ã§ã³ã¯ããã¯ã¬ãªããã®å€æ§æ§ã®ã¬ãã«ãæšå®ããããã«äœ¿çšãããŸãã æåã®ãªãã·ã§ã³
ããã¯ãµã€ãããšã«è¡ãããåºåãã¡ã€ã«ã«ã¯æ¥å°ŸèŸã.sites.piããä»ããŸãã ã®
XNUMX çªç®ã®ãªãã·ã§ã³ã¯ããŠã£ã³ã㊠ãµã€ãºã䜿çšããŠãŠã£ã³ããŠå ã®ãã¯ã¬ãªããå€æ§æ§ãèšç®ããŸãã
ãªãã·ã§ã³åŒæ°ã§å®çŸ©ãããŸãã ãã®ãªãã·ã§ã³ã®åºåã«ã¯æ¥å°ŸèŸãä»ããŸãã
ã.windowed.piãã ãŠã£ã³ããŠçã§ã¯æ®µéçãªããŒã¿ãå¿ èŠãªãããããã䜿çšããŸãã
ãªãã·ã§ã³ã¯ --phased ãªãã·ã§ã³ãæå³ããŸãã
åºå in ãã®ä» ãã©ãŒããã
--O12 ãã®ãªãã·ã§ã³ã¯ãéºäŒååã倧ããªè¡åãšããŠåºåããŸãã XNUMX ã€ã®ãã¡ã€ã«ãçæãããŸãã ã®
ãŸããæ¥å°ŸèŸ '.012' ãä»ããŠãããåå¥ã®åå人ã®éºäŒååãå«ãŸããŠããŸãã
ã©ã€ã³ã éºäŒåå㯠0ã1ã2 ã§è¡šãããæ°åã¯ãããè¡šããŸãã
éåç §å¯Ÿç«éºäŒåã®æ°ã æ¬ èœããŠããéºäŒåå㯠-1 ã§è¡šãããŸãã ã®
012 çªç®ã®ãã¡ã€ã«ã«ã¯ããµãã£ãã¯ã¹ã.XNUMX.indvããä»ããŠãããã¡ã€ã³ ãã¡ã€ã«ã«å«ãŸããå人ã®è©³çŽ°ãèšèŒãããŠããŸãã
ãã¡ã€ã«ã æ¥å°ŸèŸã.012.posããä»ãã XNUMX çªç®ã®ãã¡ã€ã«ã«ã¯ãå«ãŸãããµã€ãã®å Žæã®è©³çŽ°ãèšèŒãããŠããŸãã
ã¡ã€ã³ãã¡ã€ã«ã
--IMPUTE
ãã®ãªãã·ã§ã³ã¯ããã§ãŒãºåããããããã¿ã€ãã IMPUTE åç §ããã«åœ¢åŒã§åºåããŸãã ã€ã³ãã¥ãŒããšããŠ
段éçãªããŒã¿ãå¿ èŠãªããããã®ãªãã·ã§ã³ã䜿çšãããš --phased ãæå³ããŸãã 段éçã§ã¯ãªã
ãããã£ãŠãå人ãšéºäŒååã¯é€å€ãããŸãã 䞡察ç«éºäŒåéšäœã®ã¿ã
åºåã«å«ãŸããŸãã ãã®ãªãã·ã§ã³ã䜿çšãããšãXNUMX ã€ã®ãã¡ã€ã«ãçæãããŸãã ã€ã³ãã¥ãŒã
haplotype ãã¡ã€ã«ã«ã¯æ¥å°ŸèŸã.impute.hapããä»ããIMPUTE å¡äŸãã¡ã€ã«ã«ã¯
æ¥å°ŸèŸã.impute.hap.legendãã XNUMX çªç®ã®ãã¡ã€ã«ã¯ããµãã£ãã¯ã¹ã.impute.hap.indvããä»ããŠããŸãã
ãããã¿ã€ã ãã¡ã€ã«ã«å«ãŸããå人ã®è©³çŽ°ã瀺ããŸããããã®ãã¡ã€ã«ã¯ããã§ã¯ãããŸããã
IMPUTE ã«å¿ èŠã§ãã
--ldhat
--ldhat-geno
ãããã®ãªãã·ã§ã³ã¯ãããŒã¿ã LDhat 圢åŒã§åºåããŸãã ãããã®ãªãã·ã§ã³ã䜿çšããã«ã¯ã次ã®ããšãå¿ èŠã§ãã
--chr ãªãã·ã§ã³ã䜿çšãããŸãã --ldhat ãªãã·ã§ã³ã¯æ®µéçãªããŒã¿ã®ã¿ãåºåããããã
ãŸãã --phased ãæå³ãã段éã®ãªãåäœãšéºäŒååãçæãããŸãã
é€å€ãããŸãã ãããã¯ã --ldhat-geno ãªãã·ã§ã³ã¯ããã¹ãŠã®ããŒã¿ã次ã®ããã«æ±ããŸãã
ã¢ã³ãã§ãŒãºã§ãããããLDhat ãã¡ã€ã«ããžã§ãã¿ã€ã/ã¢ã³ãã§ãŒãºåœ¢åŒã§åºåããŸãã ã©ã¡ãã§ã
ãã®å Žåãã.ldhat.sitesããšã.ldhat.locsããšãããµãã£ãã¯ã¹ãä»ãã XNUMX ã€ã®ãã¡ã€ã«ãçæãããŸãã
ãããã¯ããããã LDhat ã®ãsitesãå ¥åãã¡ã€ã«ãšãlocsãå ¥åãã¡ã€ã«ã«å¯Ÿå¿ããŸãã
--ããŒã°ã«-GL
ãã®ãªãã·ã§ã³ã¯ãBEAGLE ãžã®å ¥åçšã®éºäŒåå尀床æ å ±ãåºåããŸãã
ããã°ã©ã ã ãã®ãªãã·ã§ã³ã§ã¯ãVCF ãã¡ã€ã«ã« FORMAT GL ã¿ã°ãå«ãŸããŠããå¿ èŠããããŸãã
éåžžãGATK ãªã©ã® SNP åŒã³åºãå ã«ãã£ãŠåºåãããŸãã ãã®ãªãã·ã§ã³ã䜿çšããã«ã¯ã
æè²äœã¯ --chr ãªãã·ã§ã³ã§æå®ããŸãã çµæã®åºåãã¡ã€ã« (
æ¥å°ŸèŸ '.BEAGLE.GL') ã«ã¯ã䞡察ç«éºäŒåéšäœã®éºäŒååã®å¯èœæ§ãå«ãŸããŠããã
ãlike=ãåŒæ°ãä»ãã BEAGLE ãžã®å ¥åã«é©ããŠããŸãã
--ãã¯ãã¯
ãã®ãªãã·ã§ã³ã¯ãéºäŒååããŒã¿ã PLINK PED 圢åŒã§åºåããŸãã XNUMXã€ã®ãã¡ã€ã«ãçæãããŸããã
æ¥å°ŸèŸã.pedãããã³ã.mapããä»ããŸãã 䞡察ç«éºäŒå座ã®ã¿ãåºåãããããšã«æ³šæããŠãã ããã
ãããã®ãã¡ã€ã«ã®è©³çŽ°ã«ã€ããŠã¯ãPLINK ããã¥ã¡ã³ããåç §ããŠãã ããã
泚: ãã®ãªãã·ã§ã³ã¯ã倧èŠæš¡ãªããŒã¿ã»ããã§ã¯éåžžã«é ããªãå¯èœæ§ããããŸãã --chr ãªãã·ã§ã³ã䜿çšãããšã
ããŒã¿ã»ãããåå²ããããšããå§ãããŸãã
--plink-tped
äžèšã® --plink ãªãã·ã§ã³ã¯ã倧èŠæš¡ãªããŒã¿ã»ããã§ã¯éåžžã«é ããªãå¯èœæ§ããããŸãã 代æ¿æ¡
PLINK 転眮圢åŒã§åºåãããšãããªãé«éã«ãªãå¯èœæ§ããããŸãã
ãã㯠--plink-tped ãªãã·ã§ã³ã䜿çšããŠå®çŸã§ããŸããããã«ããã次㮠XNUMX ã€ã®ãã¡ã€ã«ãçæãããŸãã
ãµãã£ãã¯ã¹ã.tpedãããã³ã.tfamãã
--recode
--recode ãªãã·ã§ã³ã¯ã次ã®ãããªå ¥å VCF ãã¡ã€ã«ãã VCF ãã¡ã€ã«ãçæããããã«äœ¿çšãããŸãã
ãŠãŒã¶ãŒãæå®ãããªãã·ã§ã³ãé©çšããŸããã åºåãã¡ã€ã«ã«ã¯æ¥å°ŸèŸãä»ããŸãã
ã.recode.vcfãã
ããã©ã«ãã§ã¯ãINFO ãã£ãŒã«ã㯠INFO å€ãšããŠåºåãã¡ã€ã«ããåé€ãããŸãã
åã³ãŒãã£ã³ã°ã«ãã£ãŠç¡å¹ã«ãªãå¯èœæ§ããããŸã (ããšãã°ãåèšã®æ·±åºŠã¯
å人ãåé€ãããå Žåã¯åèšç®ãããŸã)ã ãã®ããã©ã«ãã®æ©èœã¯æ¬¡ã®ãšããã§ãã
--keep-INFO ã䜿çšããŠãªãŒããŒã©ã€ãããããªãã·ã§ã³ãããã§ãå®çŸ©ããŸã
åºåãã¡ã€ã«ã«ä¿æãã INFO ããŒã --keep-INFO ãã©ã°ã¯è€æ°äœ¿çšã§ããŸãã
åã ãããã¯ããªãã·ã§ã³ --keep-INFO-all ã䜿çšããŠããã¹ãŠã®æ å ±ãä¿æããããšãã§ããŸãã
ãã£ãŒã«ãã
ãã®ä»
--æœåºãã©ãŒãããæ å ±
æå®ãããããŒã¿ã«é¢é£ãã VCF ãã¡ã€ã«å ã®éºäŒååãã£ãŒã«ãããæ å ±ãæœåºããŸãã
ãã©ãŒãããèå¥åã ããšãã°ããªãã·ã§ã³ã--extract-FORMAT-info GTãã䜿çšãããšã次ã®ããã«ãªããŸãã
ãã¹ãŠã® GT (Genotype) ãšã³ããªãæœåºããŸãã çµæã®åºåãã¡ã€ã«ã¯æ¬¡ã®ããã«ãªããŸãã
æ¥å°ŸèŸãã ããã©ãŒããã'ã
- æ å ±ãååŸ
ãã®ãªãã·ã§ã³ã¯ãVCF ãã¡ã€ã«ã® INFO ãã£ãŒã«ãããæ å ±ãæœåºããããã«äœ¿çšãããŸãã ã®
åŒæ°ã¯æœåºãã INFO ã¿ã°ãæå®ããŸãããªãã·ã§ã³ã¯æ¬¡ã®ãšããã§ãã
è€æ°ã® INFO ãšã³ããªãæœåºããããã«è€æ°å䜿çšãããŸãã çµæãšããŠåŸããããã¡ã€ã«ã¯ã
æ¥å°ŸèŸã.INFOããä»ããå¿ èŠãª INFO æ å ±ãã¿ãåºåãã§å«ãŸããŸãã
ããŒãã«ã ããšãã°ãNS ãã©ã°ãš DB ãã©ã°ãæœåºããã«ã¯ã次ã®ã³ãã³ãã䜿çšããŸãã
vcftools --vcf file1.vcf --get-INFO NS --get-INFO DB
VCF File æ¯èŒ ãªãã·ã§ã³
ãã¡ã€ã«æ¯èŒãªãã·ã§ã³ã¯çŸåšæµåçãªç¶æ ã«ããããã°ãå€ãå¯èœæ§ããããŸãã ããããããã
ãã°ãèŠã€ãããå ±åããŠãã ããã éºäŒååã¬ãã«ã®ãã£ã«ã¿ãŒã¯ãããã§ã¯ãµããŒããããŠããªãããšã«æ³šæããŠãã ããã
ãªãã·ã§ã³ã
--å·®å
--gzdiff
--vcf ãªãã·ã§ã³ã§æå®ããããã¡ã€ã«ãšæ¯èŒãã VCF ãã¡ã€ã«ãéžæããŸãã
ããããã«å ±é/åºæã®ãµã€ããšå人ãèšè¿°ãã XNUMX ã€ã®ãã¡ã€ã«ãåºåããŸã
ãã¡ã€ã«ã ãããã®ãã¡ã€ã«ã«ã¯ã.diff.sites_in_filesããšããæ¥å°ŸèŸãä»ããŠããŸãã
ããããã.diff.indv_in_filesãã --gzdiff ããŒãžã§ã³ã䜿çšããŠèªã¿åãããšãã§ããŸãã
å§çž®ããã VCF ãã¡ã€ã«ã
--diff-site-discordance
--diff ãªãã·ã§ã³ãšçµã¿åãããŠäœ¿çšââãããµã€ãäžã®äžäžèŽãèšç®ããŸãã
ãµã€ãããŒã¹ã çµæã®åºåãã¡ã€ã«ã«ã¯ã.diff.sitesããšããæ¥å°ŸèŸãä»ããŸãã
--diff-indv-äžäžèŽ
--diff ãªãã·ã§ã³ãšçµã¿åãããŠäœ¿çšââããŠãåäœããšã®äžäžèŽãèšç®ããŸãã
å人ããŒã¹ã çµæã®åºåãã¡ã€ã«ã«ã¯ã.diff.indvããšããæ¥å°ŸèŸãä»ããŸãã
-- å·®åäžäžèŽè¡å
--diff ãªãã·ã§ã³ãšçµã¿åãããŠäœ¿çšââããŠãäžäžèŽè¡åãèšç®ããŸãã ãã
ãã®ãªãã·ã§ã³ã¯ãäžèŽãã察ç«éºäŒåãæã€äºé察ç«éºäŒå座ã§ã®ã¿æ©èœããŸãã
äž¡æ¹ã®ãã¡ã€ã«ã çµæã®åºåãã¡ã€ã«ã«ã¯ã.diff.discordance.matrixããšããæ¥å°ŸèŸãä»ããŸãã
--diff-switch-error
--diff ãªãã·ã§ã³ãšçµã¿åãããŠäœ¿çšââããŠãäœçžèª€å·®ãèšç®ããŸãã
(å ·äœçã«ã¯ãã¹ã€ãããšã©ãŒã)ã ãã®ãªãã·ã§ã³ã¯ã次ã®å 容ã説æãã XNUMX ã€ã®åºåãã¡ã€ã«ãçæããŸãã
ãµã€ãéã§èŠã€ãã£ãã¹ã€ãã ãšã©ãŒãããã³å人ããšã®å¹³åã¹ã€ãã ãšã©ãŒã
ããã XNUMX ã€ã®ãã¡ã€ã«ã«ã¯ãã.diff.switchãããã³ã.diff.indv.switchããšãããµãã£ãã¯ã¹ãä»ããŠããŸãã
ã
ãªãã·ã§ã³ ãŸã in éçº
次ã®ãªãã·ã§ã³ã¯ãŸã æçµæ±ºå®ãããŠãããããã°ãå«ãŸããŠããå¯èœæ§ãé«ãã
å°æ¥çã«å€ããããšã
--fst
--gzfst
XNUMX çªç®ã®ãã¡ã€ã«ãããã§æå®ããŠãVCF ãã¡ã€ã«ã®ãã¢ã® FST ãèšç®ããŸãã
ãªãã·ã§ã³ã FST ã¯çŸåšãããã§èª¬æãããŠããåŒã䜿çšããŠèšç®ãããŸãã
ãã§ãŒãº I HapMap ããŒããŒã®è£è¶³è³æã çŸåšããã¢ã¯ã€ãº FST ã®ã¿
èšç®ã¯ãµããŒããããŠããŸãããããã¯å°æ¥å€æŽãããå¯èœæ§ããããŸãã ã®
--gzfst ãªãã·ã§ã³ã䜿çšããŠãå§çž®ããã VCF ãã¡ã€ã«ãèªã¿åãããšãã§ããŸãã
--LROH é·æã«ããããã¢æ¥åæ§ãç¹å®ããŸãã
--é¢é£æ§
åã ã®é¢é£æ§çµ±èšãåºåããŸãã
onworks.net ãµãŒãã¹ã䜿çšããŠãªã³ã©ã€ã³ã§ vcftools ã䜿çšãã