Skip to content

mmseqs splitdb does not preserve annotation of originating fasta files #376

@grst

Description

@grst

Expected Behavior

I use splitdb to run mmseqs search in parallel on a HPC (SGE).
For each match, I would like to retrieve the name of the original fasta file with mmseqs convertalis --format-output "...,qset,tset,...".

Current Behavior

Specifying qset or tset leads to a segmentation fault. Running search and convertalis on the full db works without issues.

Steps to Reproduce (for bugs)

mmseqs createdb test1.faa test2.faa db
mmseqs splitdb db db_split --split 2
for file in db_split_*_2; do
  mmseqs createsubdb ${file}.index db_h ${file}_h
done
mmseqs search db_split_1_2 db resultdb tmp
mmseqs convertalis db_split_1_2 db resultdb results.tsv --format-output "query,target,qset"

MMseqs Output (for bugs)

convertalis db_split_1_2 db resultdb results.tsv --format-output query,target,qset 

MMseqs Version:         45c4de7f1daefa06b45688195305eadedaea4d97
Substitution matrix     nucl:nucleotide.out,aa:blosum62.out
Alignment format        0
Format alignment output query,target,qset
Translation table       1
Gap open cost           nucl:5,aa:11
Gap extension cost      nucl:2,aa:1
Database output         false
Preload mode            0
Search type             0
Threads                 64
Compressed              0
Verbosity               3

repex.sh: line 7: 43190 Segmentation fault      (core dumped) mmseqs convertalis db_split_1_2 db resultdb results.tsv --format-output "query,target,qset"

Your Environment

Include as many relevant details about the environment you experienced the bug in.

  • Git commit used (The string after "MMseqs Version:" when you execute MMseqs without any parameters): 45c4de7f1daefa06b45688195305eadedaea4d97
  • Which MMseqs version was used (Statically-compiled, self-compiled, Homebrew, etc.): statically compiled
  • Server specifications (especially CPU support for AVX2/SSE and amount of system memory): Intel(R) Xeon(R) CPU E7-4850 v4 @ 2.10GHz, AVX2 support, 3TB RAM
  • Operating system and version:CentOS Linux 7 64bit / Linux 3.10.0-1127.13.1.el7.x86_64

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions