Skip to content

Conversation

@SantaMcCloud
Copy link
Contributor

FOR CONTRIBUTOR:

  • I have read the CONTRIBUTING.md document and this tool is appropriate for the tools-iuc repo.
  • License permits unrestricted use (educational + commercial)
  • This PR adds a new tool or tool collection
  • This PR updates an existing tool or tool collection
  • This PR does something else (explain below)

</options>
<validator type="no_options" message="No reference data for kMetaShot is installed. Please contact the Galaxy adminstrators to request one be installed."/>
</param>
<param argument="--ass2ref" type="float" min="0.0" value="0.0" max="1.0" label="Set ass2ref parameter" help="Ass2ref is a ratio between the number of MAG minimizers and the reference minimizers related to the assigned strain"/>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is is possible to get a better explanation on this ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cfa9bf1 should be fine now?


**Output**

The Output is a collection with csv file(s) which contained the classification for the inputted bin(s)/MAG(s)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be good if we could get our hands on one of these outputs to see the actual content, is it taxonomy per contig or per bin ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

e10b88e is this fine in this case?

<command detect_errors="exit_code"><![CDATA[
mkdir -p '$out_file.extra_files_path' &&
#if $test != "true"
wget 'https://zenodo.org/records/17375120/files/kMetaShot_bacteria_archaea_2025-05-22.h5' &&
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

since there seem to be stable releases:
https://github.com/gdefazio/kMetaShot?tab=readme-ov-file#kmetashot-reference
I think it would be better if the admins could choose between those, that's better for reproducability

Copy link
Contributor Author

@SantaMcCloud SantaMcCloud Nov 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

0acaf08 4d0b6eb i did make the DM dynamic so each new version can be added there and can be downloaded

<![CDATA[

This tool is a taxonomic classifier which used a new algorithm which is alignment free.
The data from the input files will be transformed into bites and each base is represented by 2 bites.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

L70 - 77 I think is not needed, we can just point to the github repo

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

0acaf08 you mean like this?

@paulzierep
Copy link
Contributor

@SaimMomin12 @bgruening this tool requires a large DB (~22 GB) and also a lot of RAM, the devs confirm that it's technically not doable to shrink it: gdefazio/kMetaShot#7
Any idea how we could get it in ? Should we move it to @bgruening repo?

@bgruening
Copy link
Member

Yes, please move, make sure the tests are working locally and then comment them out with a comment above the test.

@SantaMcCloud
Copy link
Contributor Author

Yes, please move, make sure the tests are working locally and then comment them out with a comment above the test.

i can write test which would work but the Problem is that you need at least 16 GiB of RAM which my PC doesnt have so i dont know if i can run them locally

@paulzierep
Copy link
Contributor

Yes, please move, make sure the tests are working locally and then comment them out with a comment above the test.

i can write test which would work but the Problem is that you need at least 16 GiB of RAM which my PC doesnt have so i dont know if i can run them locally

let me try ...

@SantaMcCloud
Copy link
Contributor Author

Yes, please move, make sure the tests are working locally and then comment them out with a comment above the test.

i can write test which would work but the Problem is that you need at least 16 GiB of RAM which my PC doesnt have so i dont know if i can run them locally

let me try ...

Thank you when i did work send me the command line and the output so i can adapte thr wrapper to it. I will fix the review you made for the DM first then!

@SantaMcCloud
Copy link
Contributor Author

@paulzierep can you check it again please if the changes i did was correct? The only thing i didnt add was the test for a possible local run

</requirements>
<command detect_errors="exit_code">
<![CDATA[
python '$__tool_directory__/kmetashot_database_installer.py'
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please do not add new data managers using the python wrapper. Just follow the scheme implemented for instance here https://github.com/galaxyproject/tools-iuc/blob/main/data_managers/data_manager_checkm2/data_manager/checkm2_datamanager.xml

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you work with this also having multiple version of the data you download or only if you have 1 data which you download evreytime?

I will change it back of course since i had it first like this but the admin should have the option to download different Version.

Also the tool itself doesnt do the download like checkm2. We have to use wget in this case

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you work with this also having multiple version of the data you download or only if you have 1 data which you download evreytime?

I will change it back of course since i had it first like this but the admin should have the option to download different Version.

There is no difference. The point is that you can implement the same actions that are currently implemented in the python wrapper in the command block:

  1. call wget
  2. create json file

Also the tool itself doesnt do the download like checkm2. We have to use wget in this case
Comment

Also wget is just a command line tool that we can call directly from the command block and there is not need for a python wrapper that just calls it.

Copy link
Contributor Author

@SantaMcCloud SantaMcCloud Nov 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah okay good to know. I didnt know this so i wasnt sure if this work in this case. Thank you for answering my question!

I think this should be good or not? c22e642

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes this looks good.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants