Skip to content

Assembly workflow#26

Merged
ochkalova merged 41 commits intodevfrom
assembly_workflow
Mar 9, 2026
Merged

Assembly workflow#26
ochkalova merged 41 commits intodevfrom
assembly_workflow

Conversation

@KateSakharova
Copy link
Contributor

@KateSakharova KateSakharova commented Nov 11, 2025

Resolving #4

This pull request introduces support for a new "metagenomic_assemblies" submission mode to the pipeline, alongside significant improvements to documentation and input schema validation.

Major new features and improvements:

Support for metagenomic assemblies:

  • Added the metagenomic_assemblies mode, routed to a new ASSEMBLYSUBMIT workflow, with its own input schema (assets/schema_input_assembly.json) and example samplesheet (assets/samplesheet_assembly.csv). The schema enforces required fields and validates that either coverage or reads are provided for each assembly.
  • Updated the main workflow (main.nf) to dispatch to GENOMESUBMIT or ASSEMBLYSUBMIT based on the --mode parameter, and to handle multiqc report output accordingly.

Documentation and usability:

  • Rewrote and expanded the README.md to clearly describe the three supported submission modes, required/optional input columns, command-line usage, and output structure. Added example samplesheets and clarified parameter documentation.

Input validation and schemas:

  • Renamed the genome submission input schema to schema_input_genome.json
  • Added new schema schema_input_assembly.json

Configuration and output structure:

  • Output directories for manifests, webin_cli reports, and multiqc reports are now grouped by mode (e.g., outdir/mags/upload/manifests) for better separation of results.
  • Added dedicated test config files for genome and assembly workflows. FIXME: test.config for now left empty.

Required improvements:
**- Schema mentions are removed from nextflow_schema.json, to enable usage of --input for different samplesheets, that enables automatic validation. validateSamplesheet function has to be added to pipeline inizialisation.

  • Tests on GitHub CI fail due to "missing secrets" error.**

PR checklist

  • This comment contains a description of changes (with reason).
  • If you've fixed a bug or added code that should be tested, add tests!
  • If you've added a new tool - have you followed the pipeline conventions in the contribution docs
  • If necessary, also make a PR on the nf-core/seqsubmit branch on the nf-core/test-datasets repository.
  • Make sure your code lints (nf-core pipelines lint).
  • Ensure the test suite passes (nextflow run . -profile test,docker --outdir <OUTDIR>).
  • Check for unexpected warnings in debug mode (nextflow run . -profile debug,test,docker --outdir <OUTDIR>).
  • Usage Documentation in docs/usage.md is updated.
  • Output Documentation in docs/output.md is updated.
  • CHANGELOG.md is updated.
  • README.md is updated (including new tool citations and authors/contributors).

@github-actions
Copy link

github-actions bot commented Nov 11, 2025

nf-core pipelines lint overall result: Passed ✅ ⚠️

Posted for pipeline commit bd9a67f

+| ✅ 200 tests passed       |+
#| ❔   4 tests were ignored |#
#| ❔   1 tests had warnings |#
!| ❗  23 tests had warnings |!
Details

❗ Test warnings:

  • readme - README contains the placeholder zenodo.XXXXXXX. This should be replaced with the zenodo doi (after the first release).
  • pipeline_todos - TODO string in README.md: Add citation for pipeline after first release. Uncomment lines below and update Zenodo doi and badge at the top of this file.
  • pipeline_todos - TODO string in README.md: Add bibliography of tools and data used in your pipeline
  • pipeline_todos - TODO string in nextflow.config: Optionally, you can add a pipeline-specific nf-core config at https://github.com/nf-core/configs
  • pipeline_todos - TODO string in nextflow.config: Update the field with the details of the contributors to your pipeline. New with Nextflow version 24.10.0
  • pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your preferred methods description, e.g. add publication citation for this pipeline
  • pipeline_todos - TODO string in nextflow.config: Specify any additional parameters here
  • pipeline_todos - TODO string in usage.md: Add documentation about anything specific to running your pipeline. For general topics, please point to (and add to) the main nf-core website.
  • pipeline_todos - TODO string in output.md: Write this documentation describing your workflow's output
  • pipeline_todos - TODO string in awsfulltest.yml: You can customise AWS full pipeline tests as required
  • pipeline_todos - TODO string in base.config: Check the defaults for all processes
  • pipeline_todos - TODO string in base.config: Customise requirements for specific processes.
  • pipeline_todos - TODO string in test_assembly.config: Specify the paths to your test data on nf-core/test-datasets
  • pipeline_todos - TODO string in test_assembly.config: Give any required params for the test so that command line flags are not needed
  • pipeline_todos - TODO string in test_genome.config: Specify the paths to your test data on nf-core/test-datasets
  • pipeline_todos - TODO string in test_genome.config: Give any required params for the test so that command line flags are not needed
  • pipeline_todos - TODO string in test_full.config: Specify the paths to your full test data ( on nf-core/test-datasets or directly in repositories, e.g. SRA)
  • pipeline_todos - TODO string in test_full.config: Give any required params for the test so that command line flags are not needed
  • pipeline_todos - TODO string in environment.yml: List required Conda package(s).
  • pipeline_todos - TODO string in main.nf.test: Once you have added the required tests, please run the following command to build this file:
  • pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
  • pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
  • pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!

❔ Tests ignored:

❔ Tests fixed:

✅ Tests passed:

Run details

  • nf-core/tools version 3.5.1
  • Run at 2026-03-09 12:01:30

@nf-core-bot
Copy link
Member

nf-core-bot commented Mar 5, 2026

Warning

Newer version of the nf-core template is available.

Your pipeline is using an old version of the nf-core template: 3.5.1.
Please update your pipeline to the latest version.

For more documentation on how to update your pipeline, please see the nf-core documentation and Synchronisation documentation.

@ochkalova ochkalova force-pushed the assembly_workflow branch from e09443f to 26c6296 Compare March 6, 2026 11:54
@ochkalova ochkalova self-requested a review March 9, 2026 16:22
Copy link
Contributor

@ochkalova ochkalova left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are still some things to fix, but it starts to look like a pipeline 😄

@ochkalova ochkalova marked this pull request as ready for review March 9, 2026 16:24
@ochkalova ochkalova merged commit da12288 into dev Mar 9, 2026
11 of 17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants