在處理fasta檔案的時候我們常常會遇到需要合併多個fasta檔案或者分隔單個fasta檔案成為每個序列一個檔案。這篇文章會告訴你如何做。

合併

Windows系統下,開啟cmd系統。(ctrl+win)

使用cd轉換地址到你需要處理的地方

type *。txt>>all。txt

例如:type A。txt B。txt > AB。txt

2. 分隔

方法一:awk

awk

‘/^>/{s=++num}{print > “example_”s“。fa”}’

example

fasta

方法二:python 指令碼

import

sys

def

parseFasta

filename

):

fas

=

{}

id

=

None

with

open

filename

‘r’

as

fh

for

line

in

fh

if

line

0

==

‘>’

header

=

line

1

:]

rstrip

()

id

=

header

split

()[

0

fas

id

=

[]

else

fas

id

append

line

rstrip

())

for

id

seq

in

fas

items

():

fas

id

=

‘’

join

seq

return

fas

fas

=

parseFasta

sys

argv

1

])

for

i

in

fas

with

open

i

+

“。fa”

‘w’

as

f

f

write

“>”

+

i

+

\n

+

fas

i

+

\n

執行方法:

在cmd下:

python splitFasta。py AB。fasta

你就會得到A。fasta和B。fasta