Update Semibin 2.0.2 to 2.2.0 #7347

SantaMcCloud · 2025-10-08T10:23:41Z

FOR CONTRIBUTOR:

I have read the CONTRIBUTING.md document and this tool is appropriate for the tools-iuc repo.
License permits unrestricted use (educational + commercial)
This PR adds a new tool or tool collection
This PR updates an existing tool or tool collection
This PR does something else (explain below)

SantaMcCloud · 2025-10-08T10:25:21Z

Test will fail because a datatype is still missing: galaxyproject/galaxy#21024
Also i did comment out a test since i think the tool can not detect anything to make this test work with the amount of data got with the test but im not sure in this case.

SantaMcCloud · 2025-10-08T13:00:11Z

The file model.pt now the question could this be slip such that even it is bigger then 1 MB it will be approved or should i remove it and also remove all the test using this as input to test certain functionality?

SaimMomin12 · 2025-10-08T13:18:29Z

The file model.pt now the question could this be slip such that even it is bigger then 1 MB it will be approved or should i remove it and also remove all the test using this as input to test certain functionality?

For file sizes >1 MB, you can consider uploading it to Zenodo and using the link in the test

SantaMcCloud · 2025-10-09T07:36:02Z

The file model.pt now the question could this be slip such that even it is bigger then 1 MB it will be approved or should i remove it and also remove all the test using this as input to test certain functionality?

For file sizes >1 MB, you can consider uploading it to Zenodo and using the link in the test

Ah okay thank you i didnt know i will do it this way then!

…-iuc into semibin_update

SantaMcCloud · 2025-10-09T07:54:58Z

#7344

SantaMcCloud · 2025-10-09T07:55:01Z

#7326

bgruening · 2025-10-09T16:18:32Z

tools/semibin/bin.xml

@@ -44,7 +44,7 @@ SemiBin2 bin
                <expand macro="environment"/>
            </when>
            <when value="history">
-                <param argument="--model" type="data" format="h5" label="Trained semi-supervised deep learning model"/>
+                <param argument="--model" type="data" format="pt" label="Trained semi-supervised deep learning model"/>


@SantaMcCloud do you know if we can use the safetensor datatype here? Or that this tool can consume safetensor datatypes?

Im not sure i didnt see anything about this in the documentation. This might work with the old file too but i have to look at it since certain packages has to be downgraded in this case.

I will test it first if this works with safetensor or not and will let you know if it works or not!

@bgruening i try to run sembin2 with safetensor but it looks like it can not be used. The only option here is to run a convert befor and after when a .pt file is used/outputed somewhere. My question now is it is possible to write a short python script in the commnad line of the wrapper to start torch and tranform it to safetensor?

With this it is possible to ouput the model created in the sembin_train will be safetensor. For semibin_bin the convert to safetensor to .pt should be also doable so semibin can run without problem and the .pt file can be deleted via commnad line.

From the Galaxy side that is possible, yes. You can include a small auxiliary Python script with the tool and then do the conversion.

If is possible from the semibin side, e.g. that no data gets lost during the transformation, I do not know :(

I can not tell it either i can try to have some runs with the test data and see if there is any problem with it. and will give you feedback then.

For the script is there any source or exmaple for it so i can have a look into it?

@bgruening so on the first look the little script i wrote for the specific usecase for semibin to convert the .pt file to safetensor and back works. both result with the testdata are the same with the original model and the converted model. So this should work as workaround! :)

Nice! This maybe helps

tools-iuc/tools/macs2/macs2_callpeak.xml

Line 127 in 848922d

python '$__tool_directory__/dir2html.py'

its just an additional script that you call around the original command.
You can put many commands in the command section, just separate them by &&

I did add the script and i hope now that it works with it, i also open an issue to see if the upstream devs might change it in the futher

File "/home/runner/work/tools-iuc/tools-iuc/tools/semibin/convert.py", line 4, in <module> from safetensors.torch import save_file, load_file ModuleNotFoundError: No module named 'safetensors'

based on this i should rewrote the recipe and include the safetensors package? @bgruening

Update Sembin 2.0.2 to 2.2.0

046c27e

This was referenced Oct 8, 2025

Add new dataype 'pt' galaxyproject/galaxy#21024

Closed

Updating tools/semibin from version 2.0.2 to 2.2.0 #5823

Open

Update generate_cannot_links.xml

2d2ba0d

SantaMcCloud added 2 commits October 9, 2025 09:52

upload model.pt to zenodo

eb55769

erge branch 'semibin_update' of https://github.com/SantaMcCloud/tools…

1c54089

…-iuc into semibin_update

bgruening reviewed Oct 9, 2025

View reviewed changes

bernt-matthias changed the title ~~Update Sembin 2.0.2 to 2.2.0~~ Update Semibin 2.0.2 to 2.2.0 Oct 14, 2025

SantaMcCloud added 5 commits October 17, 2025 00:48

add converter for .pt to safetensors

6c50a9f

swap pt to savetensors

95b84e0

fix regex

b100b63

change test link

215d78d

fix linting

b46e78e

bernt-matthias mentioned this pull request Oct 17, 2025

Semibin2 semibin_concatenate_fasta fails due to file names #7326

Open

SantaMcCloud and others added 7 commits October 18, 2025 00:10

fix some more linting and a typo

0e9b55d

fix one error and linting

942d1be

final fix

dc4e97f

fix format

a064da9

fix import order

9776fc8

add fix of different PR

d95b52c

typo

9f02f6a

Uh oh!

Update Semibin 2.0.2 to 2.2.0 #7347

Are you sure you want to change the base?

Update Semibin 2.0.2 to 2.2.0 #7347

Uh oh!

Conversation

SantaMcCloud commented Oct 8, 2025

Uh oh!

SantaMcCloud commented Oct 8, 2025

Uh oh!

SantaMcCloud commented Oct 8, 2025

Uh oh!

SaimMomin12 commented Oct 8, 2025

Uh oh!

SantaMcCloud commented Oct 9, 2025

Uh oh!

SantaMcCloud commented Oct 9, 2025

Uh oh!

SantaMcCloud commented Oct 9, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SantaMcCloud Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SantaMcCloud Oct 16, 2025 •

edited

Loading