-
Notifications
You must be signed in to change notification settings - Fork 489
Add kMetaShot and DM for kMetaShot #7421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
tools/kmetashot/kmetashot.xml
Outdated
| </options> | ||
| <validator type="no_options" message="No reference data for kMetaShot is installed. Please contact the Galaxy adminstrators to request one be installed."/> | ||
| </param> | ||
| <param argument="--ass2ref" type="float" min="0.0" value="0.0" max="1.0" label="Set ass2ref parameter" help="Ass2ref is a ratio between the number of MAG minimizers and the reference minimizers related to the assigned strain"/> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is is possible to get a better explanation on this ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cfa9bf1 should be fine now?
tools/kmetashot/kmetashot.xml
Outdated
|
|
||
| **Output** | ||
|
|
||
| The Output is a collection with csv file(s) which contained the classification for the inputted bin(s)/MAG(s) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It would be good if we could get our hands on one of these outputs to see the actual content, is it taxonomy per contig or per bin ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
e10b88e is this fine in this case?
| <command detect_errors="exit_code"><![CDATA[ | ||
| mkdir -p '$out_file.extra_files_path' && | ||
| #if $test != "true" | ||
| wget 'https://zenodo.org/records/17375120/files/kMetaShot_bacteria_archaea_2025-05-22.h5' && |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
since there seem to be stable releases:
https://github.com/gdefazio/kMetaShot?tab=readme-ov-file#kmetashot-reference
I think it would be better if the admins could choose between those, that's better for reproducability
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tools/kmetashot/kmetashot.xml
Outdated
| <![CDATA[ | ||
|
|
||
| This tool is a taxonomic classifier which used a new algorithm which is alignment free. | ||
| The data from the input files will be transformed into bites and each base is represented by 2 bites. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
L70 - 77 I think is not needed, we can just point to the github repo
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
0acaf08 you mean like this?
|
@SaimMomin12 @bgruening this tool requires a large DB (~22 GB) and also a lot of RAM, the devs confirm that it's technically not doable to shrink it: gdefazio/kMetaShot#7 |
|
Yes, please move, make sure the tests are working locally and then comment them out with a comment above the test. |
i can write test which would work but the Problem is that you need at least 16 GiB of RAM which my PC doesnt have so i dont know if i can run them locally |
let me try ... |
Thank you when i did work send me the command line and the output so i can adapte thr wrapper to it. I will fix the review you made for the DM first then! |
Co-authored-by: paulzierep <paul.zierep@googlemail.com>
Co-authored-by: paulzierep <paul.zierep@googlemail.com>
Co-authored-by: paulzierep <paul.zierep@googlemail.com>
…-iuc into add_kMetaShot
|
@paulzierep can you check it again please if the changes i did was correct? The only thing i didnt add was the test for a possible local run |
| </requirements> | ||
| <command detect_errors="exit_code"> | ||
| <![CDATA[ | ||
| python '$__tool_directory__/kmetashot_database_installer.py' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please do not add new data managers using the python wrapper. Just follow the scheme implemented for instance here https://github.com/galaxyproject/tools-iuc/blob/main/data_managers/data_manager_checkm2/data_manager/checkm2_datamanager.xml
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you work with this also having multiple version of the data you download or only if you have 1 data which you download evreytime?
I will change it back of course since i had it first like this but the admin should have the option to download different Version.
Also the tool itself doesnt do the download like checkm2. We have to use wget in this case
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you work with this also having multiple version of the data you download or only if you have 1 data which you download evreytime?
I will change it back of course since i had it first like this but the admin should have the option to download different Version.
There is no difference. The point is that you can implement the same actions that are currently implemented in the python wrapper in the command block:
- call wget
- create json file
Also the tool itself doesnt do the download like checkm2. We have to use wget in this case
Comment
Also wget is just a command line tool that we can call directly from the command block and there is not need for a python wrapper that just calls it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah okay good to know. I didnt know this so i wasnt sure if this work in this case. Thank you for answering my question!
I think this should be good or not? c22e642
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes this looks good.
…-iuc into add_kMetaShot
FOR CONTRIBUTOR: