[bioinfo]fastqファイルを分割する
fastqファイルを分割するのはseqkitで下記のコマンドで行えるが、
seqkit split <fastq_file>-i, --by-id split squences according to sequence ID-p, --by-part int split sequences into N parts-r, --by-region string split squences according to subsequence of given region. e.g 1:12 for first 12 bases, -12:-1 for last 12 bases. type "seqkit split -h" for more examples-s, --by-size int split sequences into multi parts with N sequences-d, --dry-run dry run, just print message and no files will be created.-f, --force overwrite output directory-h, --help help for split-k, --keep-temp keep tempory FASTA and .fai file when using 2-pass mode-O, --out-dir string output directory (default value is $infile.split)-2, --two-pass two-pass mode read files twice to lower memory usage. (only for FASTA format)
ファイルが大きすぎるとRAMが足りずに落ちる。その場合は、FASTQ Splitterを用いる。
Usage: fastq-splitter.pl [options] <file>...
Options:
--n-parts - Divide into parts
--part-size - Divide into parts of size
--measure (all|seq|count) - Specify whether all data, sequence length, or
number of sequences is used for determining part
sizes ('all' by default).
--eol (dos|mac|unix) - Choose end-of-line character ('unix' by default).
--check - Check FASTQ correctness.
--version - Show version.
--help - Show help.
