News
Subscribe to the Australian BioCommons monthly newsletter or read previous editions
Knowledge sharing enABLeS computational research across the life sciences
The ABLeS User Group Meetings provide opportunities to come together to share knowledge, offer tips, ask questions or give feedback.
ABLeS (Australian BioCommons Leadership Share) supports groups with their shared data-driven bioinformatics needs, who bring with them a wide range of computational research programs. The research groups benefiting from the program wouldn’t normally have an avenue to share insights and learn from each other’s experiences.
The ABLeS program now includes more than 250 registered participants, representing more than 50 Australian research institutions and a wonderfully diverse array of research projects. The ABLeS User Group Meeting offers opportunities to come together to share knowledge, offer tips, ask questions or give feedback.
Held twice a year, the ABLeS User Group meeting includes presentations from researchers about how they are making use of their allocated computational resources, alongside updates on pertinent developments at both of the Australian peak computational facilities, Pawsey and NCI. This online forum is also a chance to ask questions or provide feedback on how ABLeS can be improved to help different researcher needs.
A poll at the latest User Group meeting suggested an octopus mascot for ABLeS!
At the last meeting, users heard from three different researchers whose projects receive ABLeS resources:
Computational Structural Biology Node, Dr Keiran Rowell (UNSW)
The Effect of a Mediterranean Diet on Blood DNA Methylation Profiles in Pregnant Women, Grace Tavelli (The Kids)
Rare Disease Genetics and Functional Genomics, Assoc Prof Gina Ravenscroft (Perkins Institute)
Given that it was the inaugural ABLeS User Group Meeting, the event opened with a fun activity to find a mascot for the ABLeS service. The most popular option amongst the 39 people attending was a friendly octopus. The ABLeS team is still trying to decipher any hidden message in that choice! Other interesting results from the poll were that the majority of participants joined from NSW and WA, and two thirds of those users were working on non-human species.
The next ABLeS User Group Meeting will take place online on 2 June 2025. Registered users of ABLeS should keep an eye on their inbox for their invitation to attend.
If you aren’t yet registered for ABLeS, you can find out more in this 10 min video explainer or jump straight into the ABLeS documentation.
ABLeS (Australian BioCommons Leadership Share) was established in 2021 to support data-driven bioinformatics. Australian BioCommons partners with Bioplatforms Australia, National Computational Infrastructure (NCI), and Pawsey Supercomputing Research Centre (Pawsey).
Galaxy Australia’s 10 millionth job relied on popular locally-developed tool
The ten millionth job completed on the Galaxy Australia service analysed pathogen samples for known antibiotic resistance genes using the ABRicate tool. You’d be surprised how many time this Australian tool gets used.
At any one time, there is a wide array of research being done using Galaxy Australia. When the ten millionth job was submitted to the service recently, it illuminated the importance of good bioinformatics software. The particular job was one of a bunch that analysed various pathogen samples for known antibiotic resistance genes.
Over the last 7 years, more than 40,000 users representing researchers, students, trainers and trainees have demonstrated the platform's versatility across a wide range of fields. This milestone job investigating antimicrobial resistance employed the ABRicate tool, authored by Australian bioinformatician Torsten Seemann. Further investigation showed that 490,507 jobs run on Galaxy Australia also used ABRicate in their workflows!
The University of Melbourne’s A/Prof Torsten Seemann is the Lead Bioinformatician at the Centre for Pathogen Genomics and the Director of Bioinformatics at Doherty Applied Microbial Genomics. As a world-renowned bioinformatician he has developed cutting edge analysis approaches to enhance the use of genomic data. Torsten has authored many popular software tools for the analysis of bacterial sequence data, particularly in the fields of public health and clinical microbiology. He is also a strong advocate for open-source software and open science.
He created ABRicate for the mass screening of assemblies for antimicrobial resistance and virulence genes. Given a FASTA contig file or a genbank file, ABRicate will perform a mass screening of contigs and identify the presence of antibiotic resistance genes. The user can choose which database to search from a list of available antimicrobial resistance databases.
The name ‘ABRicate’ was chosen as the first 3 letters are a common acronym for ‘Anti-Biotic Resistance.’ The word also has the form of an English verb, which suggests the tool is actually taking action against the problem of antibiotic resistance, and it is uniqueit unique enough to be easily searchable.
The tool is widely available to anyone who needs it, and can be found in many repositories. It is one of over 10,000 tools available in the Galaxy Toolshed. There is a tutorial available on the Galaxy Training Network which supports the use of this tool: Pathogen detection from (direct Nanopore) sequencing data using Galaxy - Foodborne Edition.
Torsten was recently awarded The Chan Zuckerberg Initiative’s Essential Open Source Software for Science grant for the development of another tool, Snippy. This open-source software tool was written to rapidly identify genetic differences between bacterial genome sequences, and work is underway to allow Snippy to support the latest genome sequencing technologies and viral and fungal pathogens too.
In recognition of the ongoing important contributions that Torsten makes, an award was established in his name. The annual ABABCS ‘Torsten Seemann Outstanding Bioinformatics Software Developer Award’ (previously sponsored by the ARDC) recognises an outstanding EMCR bioinformatic software developer from the Australian community with a view to promoting further efforts to develop and share bioinformatics methodologies.
Are your essential tools available on Galaxy Australia? Request an installation by the Galaxy Australia team if there’s a tool you think should be made available for everyone to use.
Nominate your favourite tool’s author for the Torsten Seemann Outstanding Bioinformatics Software Developer Award.
Partners meet to start building the digital infrastructure needed for human genomics research though GUARDIANS
Australian researchers are set to make significant strides in discovering, accessing, and analysing human genomics data. The first in-person meeting of a significant new collaboration brought together partners who are committed to implementing the Australian BioCommons’ GUARDIANS program.
A diverse group came together, including special guest Dr Melissa Konopko from ELIXIR who is standing here with BioCommons’ Prof Bernie Pope.
Australian researchers are set to make significant strides in discovering, accessing, and analysing human genomics data. The first in-person meeting of a significant new collaboration brought together partners who are committed to implementing the Australian BioCommons’ GUARDIANS program.
The meeting in Sydney represented the start of two years of implementation work as part of the GUARDIANS mission to empower Australian researchers to easily and securely discover, access, analyse and use human genomics data across national infrastructure, using the latest tools and resources.
The project brings Australian BioCommons together with Australian Access Federation, Children's Cancer Institute / ZERO, Garvan Institute of Medical Research, National Computational Infrastructure (NCI), QIMR Berghofer Institute of Medical Research, University of Melbourne, and University of Sydney. The project builds on years of foundational work in the Human Genomes Platform Project which also included a funded program and contracted schedules of work with partners.
This first GUARDIANS meeting helped to build a sense of shared purpose as the group established effective ways of working together across their diverse organisations. Explorations into the policies, processes, and technologies that will be required during the project were driven by the open exchange of ideas, and collaborative discussions on solutions to potential challenges.
Scientific Product Manager at ELIXIR, Dr Melissa Konopko, travelled from the UK to share her insights into the Genomic Data Infrastructure (GDI). There were many parallels to learn from this project which is enabling access to genomic and related phenotypic and clinical data across Europe through establishing a federated, sustainable and secure infrastructure to access the data.
Contact us if you’d like to learn more.
Productive connections in the global Nextflow community forged by co-working
A group of Nextflow enthusiasts will come together in Sydney to take part in the global nf-core hackathon in March. By hosting the first Australian satellite site, we hope to unlock new opportunities for both national and international collaboration.
A group of Nextflow enthusiasts will come together in Sydney to take part in the global nf-core hackathon in March. By hosting the first Australian satellite site, we hope to unlock new opportunities for both national and international collaboration.
The global nf-core community curates a set of open‑source analysis pipelines built using Nextflow and they connect regularly through in person and online events. For the first time, the international hackathon’s network of live sites is extending to Australia, with the Australian outpost of the 2025 nf-core hackathon facilitating participation during our waking hours across 24-26 Mar 2025. The group will work on a variety of projects including development of training materials and nf-core workflows alongside our five local Nextflow Ambassadors and their international colleagues.
BioCommons’ Product Manager, Bioinformatics Platforms, Dr Ziad Al Bkhetan is a Nextflow Ambassador who has experienced the value of forging strong relationships in the global Nexflow community. In his work leading the Australian Nextflow Seqera Service, Ziad identified that Proteinfold, an existing nf-core pipeline for Protein 3D Structure prediction, had great potential for Australian researchers.
Ziad identified several ways to improve Proteinfold and connected with the original developers at the Centre for Genomic Regulation (CRG) in Spain, to reconfigure the workflow, add new features, and contribute the updates back to nf-core. Ziad’s demonstration of the new tool at the last Nextflow Summit created excitement in the Nextflow community. Seqera, the team behind Nexflow, was delighted that their community-driven approach is resulting in new tools being developed by people at the research interface. CRG was pleased to see that their contribution to the community facilitated a new working relationship between their Spanish biomedical research institute and Australian BioCommons. Ziad and the team from CRG will continue to work on their workflow at the nf-core hackathon, and welcome contributions from any other community members.
We’re looking forward to connecting with the Seqera team and international peers at the nf-core hackathon to develop workflows and training materials asynchronously. There is a growing list of projects being proposed and we can expect that the range will expand as teams around the world get to work.
Finding the right collaborator can fast track a project like nothing else, and specialist fields of research almost guarantee your perfect match will be on the other side of the world. We’ve already seen how strategic events can bring global developer communities together to forge valuable new relationships in the Nextflow community, and hope to catalyse more of these opportunities.
Let us know if you are interested in joining the nf-core hackathon: Australian Outpost.
Addressing DNA sequencing data submission challenges at global scale: your invitation to learn from the experts
A range of special events for Australian researchers will focus on streamlining the data submission processes for various data types, addressing common challenges, and providing best-practice guidance on interacting with the European Nucleotide Archive (ENA).
Data submission to international repositories is an essential task for researchers, but it can often come with challenges. A range of special events for Australian researchers will focus on streamlining the data submission processes for various data types, addressing common challenges, and providing best-practice guidance on interacting with the European Nucleotide Archive (ENA).
BioCommons is partnering with specialists from the ENA and the European Molecular Biology Laboratory’s European Bioinformatics Institute (EMBL-EBI) to deliver a series of hands-on workshops and online community discussions to help Australian researchers share their valuable data. The ENA is committed to maximising the impact of the ENA platform, sequencing technology and data, and will travel to Australia to support local researchers' data submission and discuss our particular needs.
The two-week program in late March and early April will feature seven targeted workshops covering submission and retrieval workflows for specific data types (raw reads, assembled and annotated genomes, metagenome assembled genomes (MAGs) and environmental DNA (eDNA)). Three interactive online community discussions around the same themes round out the program and offer the opportunity for Australian researchers to share their experiences, challenges and potential solutions directly with international specialists from the ENA.
ENA is a globally recognised open platform for the management, sharing, integration, archiving and dissemination of nucleotide sequence data. This comprehensive resource preserves the world’s public-domain output of sequence data and is an essential tool for many researchers, but sometimes people can struggle with using it. Given that submission of omics data and associated contextual metadata to global data repositories is considered best practice and is mandated by science publishers, we want to enable researchers to facilitate the long-term preservation, findability and reusability of their data.
This is a unique opportunity to gain invaluable insights and practical skills from leading experts in omics data management and submission. Whether you're a seasoned researcher or just starting out, these workshops and discussions will empower you to confidently navigate the complexities of data sharing.
'Roundtable' Community Discussions: Challenges and solutions
01 - 03 Apr, 10:00-12:00 AEDT
Join our online 'Roundtable' Community Discussions with members of the ENA team.
Each session will focus on a different theme: Genomics - genome annotations and assemblies, and pangenomes; MAGs - submitting MAGs at scale, and how to submit MAG data when there are gaps in the NCBI species taxonomy; and eDNA - recent changes enabling eDNA sequences to be more findable through submission to ENA. Register here for roundtables.
Hands-on Training workshops: Data submission and retrieval
25 Mar - 03 Apr, 13:00-16:00 AEDT
Learn directly from the ENA team in a range of practical online training workshops to explain genome assembly and annotation, MAG and eDNA data submission and retrieval.
Our community consultations teased out some of the challenges faced by Australian-based researchers in the data submission process, and BioCommons published a set of recommendations to address them. One ambition of a range of ongoing activities around interfacing with international omics data repositories is to facilitate improvements by connecting the right people.
The European Nucleotide Archive (ENA) is a Global Core Biodata Resource, an ELIXIR Core Data Resource and is hosted by EMBL-EBI. It is one of the three nodes of the International Nucleotide Sequence Database Collaboration (INSDC).
INSDC is actively seeking additional members to enhance Global Participation.
Pawsey enables more flexible and scalable DNA analysis
Researchers have developed a novel solution to process nanopore sequencing data on any mainstream GPU hardware, expanding computational options for the global bioinformatics community. Slorado is the first open-source software/library solution for nanopore sequencing basecalling on AMD Graphic Processing Units (GPUs). It was developed using Australia’s most powerful supercomputer at the Pawsey Supercomputing Research Centre with the support of the Australian BioCommons Leadership Share (ABLeS).
This story about researchers accessing high performance computing via the Australian BioCommons Leadership Share (ABLeS) was first published by the Pawsey Supercomputing Research Centre.
Dr Hasindu Gamaarachchi and Bonson Wong, the researchers behind Slorado, the new open source software for nanopore sequencing basecalling on AMD GPUs. Credits UNSW Sydney
Researchers from UNSW Sydney and the Garvan Institute of Medical Research have developed Slorado, the first open-source software/library solution for nanopore sequencing basecalling on AMD Graphic Processing Units (GPUs). Using Setonix, Australia’s most powerful supercomputer at the Pawsey Supercomputing Research Centre, this development enables researchers worldwide to process nanopore sequencing data on any mainstream GPU hardware for the first time, expanding computational options for the global bioinformatics community.
Oxford Nanopore Technologies (ONT) sequencers generate time-series raw signals that need to be converted into DNA bases through a process called basecalling. While ONT’s Dorado software supports NVIDIA GPUs for this process, AMD GPUs are currently not supported.
Led by Dr Hasindu Gamaarachchi and PhD candidate Bonson Wong, the team has created a streamlined version of the industry-standard Dorado basecaller that removes previous hardware limitations while maintaining and in some cases increasing performance. This innovation allows researchers to utilise AMD GPUs, including those in Pawsey’s Setonix, Australia’s fastest and most energy-efficient research supercomputer, reducing processing times and wait periods for critical genomic research.
Pawsey CEO Mark Stickells underscored the impact of this development:
“Dr Gamaarachchi’s work highlights our mission to accelerate scientific discovery. Fast, efficient bioinformatics is key to addressing challenges in medicine and biotechnology, from genomic analysis to drug discovery and personalised medicine. By enabling nanopore sequencing on Setonix’s AMD GPUs, we are breaking down computational barriers and empowering researchers with world-class tools,” Mark said.
“This collaboration demonstrates how state-of-the-art infrastructure can transform research workflows and broaden access to advanced computational tools.” He continued.
Dr Gamaarachchi and Bonson Wong shared their vision for Slorado:
”Our goal was to expand computational options for the bioinformatics community. By enabling basecalling on AMD GPUs and making the software fully open-source, we are empowering researchers to process their data more efficiently using both AMD and NVIDIA hardware.”
Slorado supports up to eight GPUs simultaneously on the same node, delivering unprecedented processing capabilities when combined with systems like Setonix. Researchers can now achieve faster results without enduring lengthy queue times, accelerating the pace of critical genomic discoveries.
The team access to Setonix has been supported via the Australian BioCommons Leadership Share (ABLeS).
To know more about Slorado, register for an online event on 18 Mar 2025:
Accelerating Nanopore Analysis: Introducing the new Open Source Slorado.
Interstate students converge in Melbourne to enhance their bioinformatics
The participation of 18 students from around Australia was supported by travel grants to AMSI BioInfoSummer. The annual conference is a valuable opportunity to explore current bioinformatics research and developments, with the most recent meeting hosted by the University of Melbourne.
AMSI BioInfoSummer offers a valuable opportunity each year to explore current bioinformatics research and developments. For the most recent conference, BioCommons helped make the experience more accessible to 18 students from around Australia by sponsoring travel grants.
Organised by the Australian Mathematical Sciences Institute, the conference was hosted by the ARC Centre of Excellence for the Mathematical Analysis of Cellular Systems (MACSYS) and Melbourne Integrative Genomics (MIG) at the University of Melbourne.
As part of AMSI BioInfoSummer 2024’s well-rounded program, the ‘Emerging technologies’ theme included a ‘Hello Nextflow!’ workshop, presented by Australian BioCommons and our Sydney Informatics Hub Node at the University of Sydney.
Learn more about the outstanding recipients of our travel grant.
New bioinformatics pipeline for the identification and detection of viral genomes from Oxford Nanopore sequencing
Transformative genomic technologies can now be leveraged for detecting pathogens thanks to the innovative bioinformatics pipelines developed by bioinformatics experts in Queensland. Their work is helping protect Australian plant industries by making the new capabilities accessible to biosecurity sites nation-wide, and making the pipelines and technical instructions accessible to all researchers and diagnosticians.
An example output plot generated by the pipeline
Pathogenic viruses and viroids infecting plants can result in significant economic and ecological losses. In recent years, advances in Oxford Nanopore Technologies (ONT) have enabled real-time sequencing of long sequences through portable sequencers and thus offered a new avenue for plant virus diagnostics and research. A team from Queensland University of Technology (QUT) led by Assoc Prof Roberto Barrero and the Queensland Cyber Infrastructure Foundation (QCIF), have developed a new bioinformatics pipeline that facilitates the detection of viruses and viroids from both amplicon and metagenomic ONT data using different analytical strategies.
The pipeline called ONTViSc (ONT-based viral screening) can: 1) perform a direct search on the sequenced reads, 2) generate clusters, 3) assemble the reads to generate longer contigs or 4) directly map reads to a known reference. The pipeline is written in Nextflow, ensuring that it is reproducible, scalable, and can be run on a diverse set of compute infrastructures. It was written by Dr Maely Gauthier and tested by Dr Magda Antczak.
The workflow was developed in consultation with both researchers and diagnosticians at the Australian Government’s Department of Agriculture, Fisheries and Forestry (DAFF) to understand how it would best cater to their needs based on their current and projected internal usage of ONT, as they transition some of their processes to the new technology. The pipeline has been validated with edge samples and helped resolve the complete genome of novel plant viruses.
The ONTViSc pipeline is now available for anyone to deploy or adapt through QUT’s GitHub repository, and the how-to guide is available within the Australian BioCommons How-to Hub.
Magda utilised the Australian Nextflow Seqera Service to develop, optimise and run the ONTViSc pipeline, and spoke about the benefits of this fully subsidised service in December at the Australian Bioinformatics and Computational Biology Society (ABACBS) National Seminar Series.
The completion of this pipeline development has led to an independent nation-wide project aimed at transforming biosecurity diagnostics by making standardised bioinformatics workflows and interactive reports accessible to all DAFF biosecurity sites. The project, DNA to Decisions: Concise communication of genomics data for easier decision making is supported by a partnership between Australian BioCommons (as part of Workflow Commons), QCIF, QUT and DAFF.
Boosting genomic medicine in Sri Lanka
A workshop in Sri Lanka taught medical doctors, research lab workers and university students to use Galaxy Australia for genomic medicine.
A group of Sri Lankan medical doctors, research lab workers and university students have just completed a hands-on bioinformatics training program using Galaxy to learn about genomics. Galaxy Australia team member, Dr Nuwan Goonasekera, joined an international panel of experts to deliver the workshop Bioinformatics for Genomic Medicine.
The US Embassy in Sri Lanka and Genelabs Medical offered the three week course to empower Sri Lankan genomics researchers to independently process and analyse DNA and RNA sequencing data. Participants gained the skills needed to identify clinically actionable mutations, helping to guide treatment plans based on patient data.
Dr Shurjo Sen from the NIH National Human Genome Research Institute (NHGRI) Office of Genomics Data Science requested Nuwan teach the 28 participants to use Galaxy in collaboration with the University of Colombo. The Galaxy Australia service underpinned the training while trainees were learning how to use Galaxy for sequence analysis and Nuwan made good use of Galaxy Australia’s Training Infrastructure as a Service (TIaaS). They also accessed the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL) to learn cloud computing and variant interpretation techniques. Use of this cloud-based genomic data sharing and analysis platform, along with Galaxy, will empower these Sri Lankan genomics researchers to access and analyse large genomic datasets in a collective and collaborative way.
Researchers come together to tackle the bioinformatics of fungal genomics
A diverse group representing commercial enterprise, academia, government research and citizen scientists gathered at ANU for a hands-on fungi bioinformatics workshop. The skills uplift was designed to support Bioplatforms Australia’s Functional Fungi and Plant Pathogen Omics National Initiatives.
Bioplatforms Australia brought together members of the Functional Fungi and Plant Pathogen Omics National Initiatives for a hands-on fungi bioinformatics workshop. Recognising that high throughput sequencing is the new normal for fungi research, hosts Prof Benjamin Schwessinger (Australian National University) and Dr Alistair McTaggart (Psymbiotika Lab) guided 38 researchers through theory sessions, hands-on practice, and real examples of applying bioinformatics to fungal research. Attendees were of diverse backgrounds in commercial enterprise, academia, government research and citizen science.
Each theory session was followed by a hands-on demonstration using Galaxy Australia. For example, Dr Mareike Möller (ANU) led a theory session on fungal genome assembly covering key concepts such as why long reads are essential for assembling complete genomes, and why genome assembly is like tackling the world’s hardest jigsaw puzzle. This was followed by Dr Anna Syme’s demonstration of genome assembly on Galaxy Australia. Anna wrote six dedicated workflows for fungi, which are now publicly available for any Australian to reuse or adapt on Galaxy Australia. To access them, head to Public Workflows and filter by tag ‘fungi’. If you’re just getting started with workflows, watch Anna’s Top Tips for using Galaxy workflows.
Benjamin and Alistair ensured the workshop was directly relevant to real-world fungal and plant pathogen research. All demonstrations used fungi data shared by Alistair, plus Dr Tara Garrad and Dr Kelly Hill from the South Australian Research and Development Institute, which remains available in the Bioplatforms Australia Data Portal.
Workshop participants felt that the mix of hands-on and theory sessions was ideal; demystifying the technology and bringing clarity to the broad array of options available in Galaxy Australia. Workshop materials and metadata are available in the Australian BioCommons Training Materials Zenodo repository.