News

Subscribe to the Australian BioCommons monthly newsletter or read previous editions  

Patrick Capon Patrick Capon

The Australian Nextflow Seqera Service: Your one-stop shop for Nextflow workflows

The new service provides access to a centralised command post for Australian researchers to manage, launch and monitor their Nextflow workflows.

A new service provides access to a centralised command post for Australian researchers to manage, launch and monitor their Nextflow workflows. This subsidised access to Seqera Platform is now available through a licensing agreement between Seqera and Australian BioCommons.

Researchers can run workflows on the Australian Nextflow Seqera Service using their existing allocations at their preferred compute infrastructure, including local high-performance computers (HPCs) or commercial cloud services. Alternatively, national compute resources at Tier 1 facilities are available through the complementary Australian BioCommons Leadership Share (ABLeS). Dr Magdalena Antczak, Bioinformatician at QCIF, found this connection particularly helpful:

Thanks to the resources allocated to us by the ABLeS program, we were able to launch the ONTViSc pipeline detecting viruses from plants from within the Australian Nextflow Seqera Service. We could test the pipeline thoroughly using multiple high-performance computing infrastructures and cater for users without easy access to bioinformatics services.

Out-of-the-box configurations for running Nextflow pipelines on Pawsey Supercomputing Research Centre’s Setonix and National Computational Infrastructure’s (NCI’s) Gadi are available. Or if research groups need a more bespoke approach, extensive technical documentation and the friendly user guide will help tailor the Australian Nextflow Seqera Service to their needs.

Dr Julie Iskander, Lead Research Computing Engineer at WEHI, has seen the benefits for researchers first hand:

WEHI’s Research Computing Platform Engineering team supports WEHI researchers through software engineering services to build tools and pipelines. Seqera Platform has helped us a lot. We've been able to invite our researchers to jump in and see if the platform meets their needs. With the support of the BioCommons, we've had 15 researchers across 7 of our different research groups try it out. It's made launching pipelines easy for researchers who are not familiar with linux systems and command line. This helps them to independently run complex workflows on the HPC, with minimum knowledge of its underlying complexities.

WEHI was one of the 33 groups across 16 Australian research institutes taking part in BioCommons’ successful two-year pilot program, supported by Pawsey, NCI, Sydney Informatics Hub, Queensland Cyber Infrastructure Foundation (QCIF) and the University of Melbourne. Over that time, Seqera worked with BioCommons to understand how to best support the local Nextflow community and match Australian researchers’ sophisticated usage of the workflow management and data analysis environment, Seqera Platform.

The Australian Nextflow Seqera Service is fully subsidised for groups of up to three users to work collaboratively in a dedicated workspace. Larger organisations can explore the service at no cost by bringing an unlimited number of users to their dedicated workspaces for their first year of use (annual fee applies thereafter).

The Australian Nextflow Seqera Service is a key component of BioCommons’ vision to build an ecosystem of data analysis infrastructures that empower breakthrough discoveries.

Find out more about the Australian Nextflow Seqera Service and register to get started today!

The Australian Nextflow Seqera Service is operated by Australian BioCommons in collaboration with Pawsey, NCI, and Seqera. It is hosted on Amazon Web Services and supported by Bioplatforms Australia via NCRIS funding.

Read More
Patrick Capon Patrick Capon

Attend AMSI’s BioInfoSummer with support from Australian BioCommons

Australian BioCommons is once again supporting undergraduate and postgraduate students from around the country to travel to AMSI BioInfoSummer in Melbourne.

Australian BioCommons is once again supporting undergraduate and postgraduate students from around the country to travel to AMSI BioInfoSummer in Melbourne. 

Australian Mathematical Sciences Institute’s annual event brings together people from all disciplines to discuss the latest research and developments in bioinformatics. This flagship conference is well known for connecting those new to bioinformatics research with leading experts, and enabling cross-disciplinary networking under a shared interest in bioinformatics.

AMSI BioInfoSummer is being held at the University of Melbourne with three themes: whole-cell modelling, machine learning and AI for genomics, and emerging technologies. The emerging technologies theme includes a two-part workshop ‘Hello Nextflow!’ presented by Australian BioCommons and our Sydney Informatics Hub Node at the University of Sydney. 

Previous attendees have described BioInfoSummer as “action-packed lectures and hands-on workshops from the best in the bioinformatics field, from Australia and abroad.”

If you’re an undergraduate or postgraduate student who could benefit from full or partial travel and accommodation funding to participate in the December conference, apply before 14 October!

Read More
Patrick Capon Patrick Capon

BioCommons features in the National Digital Research Infrastructure Strategy

The National Digital Research Infrastructure Strategy outlines priority outcomes to ensure  Australian researchers continue to have access to cutting-edge research infrastructure, and features Australian BioCommons as a key case study.

The NDRI Strategy cover page shown on a laptop

The Australian Government Department of Education has published its 2030 vision for ensuring that Australian researchers maintain access to cutting-edge research infrastructure. The document features Australian BioCommons, describing how three BioCommons activities are addressing national priorities. 

The National Digital Research Infrastructure (NDRI) Strategy identified six priority outcomes to achieve the NDRI vision. By 2030, Australia’s system should be:

1. Underpinned by training frameworks for researchers and the NRI workforce.

2. Responsive to technological and societal shifts.

3. Consistent in its standards for data collection, curation, and access.

4. Integrated across levels of computing and data infrastructure.

5. Cybersecure, particularly for national-scale data and computing.

6. Maximised by openly available research software tools. 

The Strategy highlights how Australian BioCommons, in our role as the bioinformatics capability of Bioplatforms Australia, are taking up the challenge of training researchers and the NDRI workforce. It notes the high level of engagement in our training program and links this to the recruitment of NDRI users. Looking forward, the NDRI strategy calls for ‘the NDRI system [to be] underpinned by training frameworks for researchers and the NRI workforce,’ citing the importance of existing activities such as DreSA, another activity that BioCommons has proudly supported.

The Australian AlphaFold Service was highlighted as a notable example for the second priority outcome that ‘the NDRI system should be responsive to technological and societal shifts.’ The service represents a national infrastructure level response to the technological shift caused by the rise of Google DeepMind’s AlphaFold technology. AlphaFold is already accelerating research fields that rely on protein structures, such as drug discovery, vaccine design or resilient crop development, with predicted structures available in just minutes compared with existing slow, laborious experimental techniques. The Australian AlphaFold Service takes care of the set-up and provisioning of underlying infrastructure so researchers can focus on rapidly generating their protein 3D structures through AlphaFold. The service is delivered by BioCommons in partnership with the Queensland Cyber Infrastructure Foundation and the University of Melbourne.

A collaboration between Australian BioCommons and the Australian Access Federation was highlighted in the Strategy as a key example of ensuring that Australia’s NDRI remains cybersecure. The collaboration delivered a candidate solutions report to guide approaches to federated identity and access management in both the Human Genomes Platform Project, and for future research infrastructures to apply. Work in this space is ongoing, with access management solutions being deployed across the BioCommons Human Genome Informatics program.

An independent working group has been formed and will now take community input to develop investment plans, which are expected to be released in 2025.

Read the full National Digital Research Infrastructure Strategy on the Department of Education website.

Australian BioCommons is enabled by Bioplatforms Australia via National Collaborative Research Infrastructure Strategy (NCRIS) funding.

Read More
Patrick Capon Patrick Capon

Advancing the Nextflow conversation: Connect with Seqera’s Lead Developer Advocate in Melbourne

Dr Geraldine Van der Auwera is visiting Melbourne in September to support Australia’s activities around Nextflow and Seqera Platform and connect with users.

Flyer advertising Geraldine's visit

Dr Geraldine Van der Auwera, Lead Developer Advocate at Seqera, is visiting Melbourne in September to strengthen ties and support the growth of bioinformatics activities in Australia. She will meet with key stakeholders and deliver a public webinar to share the latest technical innovations and opportunities to engage with Nextflow and Seqera Platform (formerly Nextflow Tower).

The ongoing relationship between Australian BioCommons and Seqera is uplifting Australian researchers to access and deploy Seqera’s products, including Nextflow and Seqera Platform. Geraldine is visiting Melbourne to discuss future Nextflow-related activities with BioCommons and the Australian Nextflow Ambassadors, Dr Georgie Samaha and Dr Ziad Al Bkhetan. They want to know if an informal Australian Nextflow network would benefit life scientists and bioinformaticians. Share your thoughts by filling out a brief survey: Assessing interest in an Australian Nextflow network.

You can hear more from Geraldine when she delivers a BioCommons webinar Building the future of bioinformatics with Nextflow: Technical innovation, community engagement, and career development opportunities on 19 Sep 2024.  You can also meet with Geraldine in spare moments around her GA4GH Plenary attendance. Please email comms@biocommons.org.au if you would like to be connected.

P.S. If you’re looking to get hands-on with Nextflow, apply to join the Hello Nextflow! workshop on by 10 September. This workshop is being offered by BioCommons, Seqera and the Sydney Informatics Hub.

Read More
Patrick Capon Patrick Capon

New resources power long-running workflows at Pawsey Supercomputing Research Centre

In response to community requests, new resources supporting cutting edge bioinformatics workflows are available on Pawsey’s Setonix supercomputer.

The Setonix supercomputer

Pawsey’s Setonix supercomputer (supplied by Karina Nunez).

Specialised nodes are now available at the Pawsey Supercomputing Research Centre that are designed to power long-running scientific workflows. Responding to researcher demand, new Workflow Nodes have been custom built on Setonix to optimise and support workflows managed by tools like Nextflow and Snakemake that surpass the regular 96 hour wall-time constraint.

Researchers voiced their challenges in running long workflows, including numerous reports from the BioCommons computational workflows community that they were running out of wall-time - the clock time it takes for a computation to run from start to finish. One of these researchers was Lauren Huet, Bioinformatics Research Officer at the Minderoo OceanOmics Centre at UWA:

Our Ocean Genomes project is addressing a key gap where over 95% of marine vertebrates lack sequenced genomes. Building such a comprehensive reference genome library requires intensive compute power, and the workflows can be quite long. This project would not be possible without the capacity to scale up to process tens or hundreds of genomes in parallel.

Dr Sarah Beecroft, Life Sciences Supercomputing Specialist at Pawsey, led the team effort to build dedicated Workflow Nodes on Pawsey’s Setonix - the most powerful research computer in the Southern Hemisphere. 

Setonix’s Workflow Nodes provide a stable and robust environment for workflow orchestration. Users can launch their master jobs interactively and keep their sessions alive for extended time periods, enhancing both productivity and performance. I’m really excited to see the new research that is enabled!

Lauren and the OceanOmics team are already benefiting greatly from the Workflow Nodes:

It’s been a game-changer for our research! The nodes enable us to run Nextflow pipelines directly in the terminal, offering unparalleled flexibility for developing and testing our workflows. The capability to execute long-running pipelines without interruptions has significantly increased our throughput, allowing us to produce results faster and more efficiently.

As a member of the BioCommons BioCLI project, Sarah is passionate about making command-line infrastructure accessible and well documented. Together with other supercomputing experts, the team has produced a new comprehensive technical user guide for users looking to run their workflows on the Setonix Workflow Nodes.

Learn how to run workflows on the Workflow Nodes in Pawsey’s user support documentation, or join the next meeting of the BioCommons computational workflows interest group to influence future research infrastructure developments. 

Read More
Patrick Capon Patrick Capon

Supercomputing access powers paediatric research

What do the human respiratory virome and mediterranean diets have in common?

They’re both research programs at The Kids Research Institute that are being supported by the Australian BioCommons Leadership Share.

The Kids logo and Dr Patricia Agudelo-Romero

Dr Patricia Agudelo-Romero presents a poster at the 2024 AAAI conference (supplied).

Demand for high performance supercomputing resources among life scientists is increasing thanks to consistent growth in both the scale and complexity of omics datasets and analyses. The Australian BioCommons Leadership Share (ABLeS) offers a specifically tailored mix of infrastructure and computational resources to support life sciences research, providing an alternative access mechanism to Tier 1 resources outside of onerous merit-based applications. 

The Kids Research Institute Australia, formerly Telethon Kids Institute, is a great example of the support ABLeS provides to research groups. As a word-class paediatric research centre, The Kids is committed to improving children’s health across its 4 key research themes: Indigenous Health, Brain and Behaviour, Chronic and Severe Diseases, and Early Environment. Many of its programs require sophisticated computational biology tools and resources, including the P4 Respiratory Health for Kids team. The P4 team focuses on the significant healthcare burden of childhood respiratory diseases, with around 20% of Australian children developing recurrent respiratory disorders such as wheezing and asthma.

Dr Patricia Agudelo-Romero, Senior Research Fellow, leads the computational biology and bioinformatics arm of the Wal-yan Respiratory Research Centre within The Kids and is a key member of the P4 team. She uses ABLeS resources to conduct omics analyses including epigenetics, transcriptomics and metagenomics. Patricia and the P4 team recently presented two studies enabled by ABLeS - understanding the methylation landscape of in utero programming in relation to asthma risk factors (part of the AERIAL study), and exploring the complexity of the human respiratory virome. The methylation study was a featured poster at the 2024 American Academy of Allergy, Asthma & Immunology conference, while the lung virome work won best selected talk at the Microbiome Virtual International Forum in 2022, having uncovered a high diversity of bacteriophages in the airways, which may play an important role in modulating the lung ecosystem. 

ABLeS enabled both our studies to process more than 2,300 FASTQ files from targeted high-throughput methylation sequencing and shotgun metagenomics experiments, using two methylation-related nextflow pipelines and one related to virus discovery. These large-scale and computationally demanding analyses would not be possible without cutting-edge resources like our access to the Pawsey Supercomputing Research Centre provided through ABLeS.

In alignment with the open-science principles of ABLeS, Patricia has made her nextflow pipelines publicly available through the nf-core community - namely the EVEREST for viral assembly and characterisation, and target-methylseq-qc which performs downstream analyses after running a standardised nf-core methylseq pipeline. The same nf-core pipeline is being applied in another project at The Kids Institute, where the Clinical Epigenetics team are analysing whether a mediterranean diet induces DNA methylation changes in pregnant women as part of the ORIGINS study. ABLeS is enabling the team to run the methyl-seq pipeline, including ensuring the pipeline can be run on the upcoming Australian Nextflow Seqera Service.

Could your research team benefit from what ABLeS offers? Watch Dr Ziad Al Bkhetan give an overview of the service.

Read More
Patrick Capon Patrick Capon

Video tutorial simplifies sharing of bioinformatics tools

Do you write or maintain bioinformatics tools? Make them accessible to the extensive Galaxy user base by following along with the new Getting Tools into Galaxy videos.

Thousands of bioinformatics tools from across the entire omics spectrum are available within Galaxy’s user-friendly web interface. New guidance videos developed by the Galaxy Australia team are supporting anyone with a little bit of coding know-how to add their favourite tools into the global Galaxy platform.

With over 11,000 individual users accessing Galaxy every month, there are frequent requests to add new tools that cater to an ever expanding array of research needs, and there is often a backlog of tools waiting to be ‘wrapped’ for use in Galaxy. The vibrant community of contributors who maintain Galaxy are passionate about open source and accessible science, and invite all tool developers, researchers, and research communities to add bioinformatics tools to Galaxy. 

Galaxy Australia’s Dr Cameron Hyde and Michael Thang have prepared two new videos that explain and demonstrate the tool wrapping process to help anyone prepare tools for  inclusion in Galaxy workflows across the globe.

In part one, Michael introduces the Galaxy platform, explains the process of wrapping tools for Galaxy and describes the tool parameters that can be incorporated into the underlying code (XML). Then in part two, Cameron steps through the process for building an XML wrapper to add a tool to Galaxy using Planemo. To get the most out of the tutorial, you’ll need to have a basic understanding of Linux command line (Bash) and XML syntax, and be comfortable working with a code editor. These videos complement the extensive documentation available in Planemo and the Galaxy Training Network.

If there is a tool missing from your Galaxy workflow, or you’d like to make your own tools available to a global audience, start wrapping with the Getting tools into Galaxy videos!

Read More
Patrick Capon Patrick Capon

Bioinformatics innovations helping keep Australia safe from plant-borne disease

The Australian Government’s Department of Agriculture, Fisheries and Forestry is at the frontline of ensuring the safety of plant materials that are commercially imported. With increasing demand for high throughput screening, they leveraged Galaxy Australia to help ensure new diseases don't slip through.

The DAFF science and surveillance team standing in a group

The DAFF science and surveillance team (provided).

If you’ve ever watched Border Security, you’ll know that Australia’s unique ecosystem is fiercely protected from incoming plant diseases. The Australian Government’s Department of Agriculture, Fisheries and Forestry (DAFF) is at the frontline of ensuring the safety of plant materials that are commercially imported. With increasing demand for high throughput screening, they leveraged Galaxy Australia to help ensure new diseases don't slip through.

At DAFF’s post entry quarantine facility in Mickleham, viruses and viroids make up two-thirds of all diseases screened for. With import quantities growing, the demand for viral screening was placing an unmanageable workload on DAFF staff using conventional screening methods. Now, a long-standing collaboration between DAFF and A/Prof Roberto Barrero at QUT has produced an innovative bioinformatics workflow for viral screening called VirReport. It employs small RNA sequencing and can be readily deployed on Galaxy Australia and other compute infrastructures. 
As part of the collaboration, the Galaxy Australia team ensured all the necessary tools were available to support the development of a series of virus reporting workflows. DAFF’s Operational Technology team led by Callum Tyle automated the entire process in Galaxy using the public Galaxy API and the BioBlend python library from data upload, job scheduling and monitoring, through to data download. Dr Ruvini Lelwala, Bioinformatician - Operational Science and Surveillance (Post Entry Quarantine Facility) - in DAFF’s Science and Surveillance Group, then got busy training molecular biologists and plant pathologists to use VirReport, to the extent that anyone with training can now easily trigger the screening process. Ruvini speaks highly of the experience working with Galaxy:

The robust and well documented Galaxy APIs have allowed for a tailored experience to be provided to our staff within a dedicated web interface, allowing staff to schedule analyses and view results. The workflows are now routinely used by our bioinformatician and other staff at DAFF.

VirReport has become the primary screening method used by DAFF for imported Prunus (stone fruit), Rubus (brambles), Fragaria (strawberries) and clonal grasses (e.g. Zoysia, Stenotaphrum). Since deployment, 693 samples have been processed through the GA-VirReport workflow. Ruvini highlighted several Galaxy Australia features that the team enjoy:

Recent changes to searching and viewing histories have streamlined our data movement between histories, and the ability to conditionally skip workflow steps ensures continuation of independent processes within a workflow. We’ve also noticed an expanding suite of tools for statistical evaluation and data visualisation, which we continue to explore.

Learn more about the implementation of the Galaxy Australia viral screening workflow in the team’s publication: Implementation of GA-VirReport, a Web-Based Bioinformatics Toolkit for Post-Entry Quarantine Screening of Virus and Viroids in Plants.

Read More
Patrick Capon Patrick Capon

Learn new biological data analysis skills at this free online global training event

The Galaxy Training Academy 2024 is a week-long, completely free, global online event will help you master the Galaxy platform for data analysis.

Galaxy Training Academy logo

The Galaxy Training Network is thrilled to bring you Galaxy Training Academy 2024. This week-long, completely free, global online event will help you master the Galaxy platform for data analysis. This web-based platform is widely used by researchers world wide and gives you access to 1000’s of popular tools for analysis and processing of biological data.

What you learn is entirely up to you - there are learning tracks dedicated to bacterial genomics, genome assembly, machine learning, microbiome analysis, single cell omics, transcriptomics, and viral genome sequencing. Choose what’s of interest, learn at your own pace, and get online support 24/7 through dedicated Slack channels. The Galaxy Australia team will be on hand to answer on Slack when everyone else is asleep!

Don’t miss this opportunity to advance your skills in bioinformatics and data analysis!

When: 7 - 11 October, 2024

Where: Online

Format: Asynchronous, choose-your-own-adventure, video tutorials with online support

Price: Free

More information and registrations via the Galaxy Academy 2024 website.  

Registrations close on 30 September 2024

Read More
Patrick Capon Patrick Capon

Australia’s new Nextflow Ambassadors connect users to global community

Meet the Australians championing Nextflow as a standard for defining bioinformatics workflows.

Nextflow has emerged as a standard for defining bioinformatics workflows. Australian BioCommons is responding to increased use of Nextflow in the life sciences by convening the Bioinformatics Workflows community, offering Nextflow training, and standing up an Australian Nextflow Seqera Service. Now, Australia has two Nextflow Ambassadors to support the researcher community.

Dr Georgie Samaha and Dr Ziad Al Bkhetan have been awarded positions in the Nextflow Ambassador Program to ensure Australian users are represented on the global stage. The program fosters collaboration, knowledge sharing, and community growth. 

Georgie is the bioinformatics group lead of the Sydney Informatics Hub at the University of Sydney and is passionate about making bioinformatics more accessible for researchers. She works with BioCommons to develop public digital infrastructure and leads the BioCommons BioCLI project.

I’m excited to help bridge the gap between life scientists using Nextflow and technical experts. As ambassadors we can understand their needs and voice them directly to Seqera

Ziad is Product Manager, Bioinformatics Platforms at Australian BioCommons and enjoys uplifting life scientists to achieve their research outcomes in two BioCommons services:  ABLeS and the Australian Nextflow Seqera Service

I’m looking forward to acting as a local contact for Nextflow users, and helping to build a national Nextflow capability that can contribute best practice workflows to the international community.

Dr Geraldine Van der Auwera, Lead Developer Advocate at Seqera (the company behind Nextflow), is looking forward to meeting the ambassadors in person when she visits Australia in September:

We're thrilled to have Georgie and Ziad join the Nextflow Ambassadors! We're keenly aware of the distance and time-related challenges that exist between Australia and other Nextflow hubs, so we expect that having not one, but two ambassadors available to share their expertise with their local networks will make a big difference! We’re hopeful that their localised support, training, and knowledge sharing will ensure that researchers and institutions across Australia can utilise Nextflow effectively in their work.

Both ambassadors have hit the ground running, with Ziad presenting at the upcoming Nextflow Summit Barcelona 2024, and Georgie playing a key role in organising national training events like the upcoming Hello Nextflow! workshop.

Georgie and Ziad are assessing interest in the formation of an informal Australian Nextflow network that would benefit life scientists, and they want to hear from you! If you use Nextflow, or are planning to in the near future, share your thoughts by filling out this brief survey.

Read More