Centre of Excellence for Biomolecular Research
- Website: https://bioexcel.eu
- Started: November 1, 2015
- Duration: 3+3.5 years
- Project reference: https://doi.org/10.3030/823830 https://doi.org/10.3030/675728
BioExcel is a Centre of Excellence for Biomolecular Research, initially funded for three years by the EC Horizon 2020 program for e-Infrastructure, and refunded for another three years in BioExcel-2. (BioExcel-3 is funded from 2023, but without The University of Manchester participating, as the funding programme moved to EuroHPC which the UK is not participating in).
The centre facilitate the use of high-performance computing (HPC) and high-throughput computing (HTC) in biomolecular research, both in academia and industry.
The BioExcel project aims to improve the performance, efficiency, scalability and accessibility of software for biomolecular modelling (in particular GROMACS, HADDOCK, CP2K, PMX), allowing such tools to be scaled up to run on larger supercomputer systems, while also making them easier for biomolecular researchers to use on computational infrastructures by providing training and guidance on best practices and integration with commonly used workflow systems such as CWL, KNIME, Galaxy, Apache Taverna or COMPSs.
As a Centre of Excellence, BioExcel gathers biomolecular consultants with experience in both HPC/HTC and the life sciences, who can assist both academic and industrial researchers with using and improving tools for computational biomolecular research.
eScience Lab contributions
In the BioExcel-2 project we were involved in:
- WP2: Convergence of HPC/HPDA and Improved Usability (deputy lead)
- T2.1: Application building blocks for computational biomolecular simulations
- T2.2: Definition, development and specification of workflow prototypes and demonstrators
- T2.4: Convergence of High Performance Computing (HPC) and High-Performance Data Analytics (HPDA)
- T2.5: Provisioning and maintaining a Workflow Environment
- T2.6: Retaining usability, interoperability and reproducibility in Exascale workflows (task lead)
- WP3: Community, Support and Use Cases
- T3.1: Use Case Support
- T3.2: Community Participation
- T3.3: In-Depth Support
- T3.4: Standards and Best Practice (task lead)
- T3.5: User & Community Needs Analysis
- T3.6: Challenge Participation and Support
- WP4: Dissemination and Training
- T4.1: Evaluate and test virtual learning environment
- T4.2: Promotion and outreach
- T4.3: User training
- T4.4: Capacity building and knowledge transfer
- T4.5: Collaboration
We built on our extensive experience and engagement with Common Workflow Language (CWL) and Research Objects and strengthened links between BioExcel communities and related projects like ELIXIR, EOSC and BioConda.
We developed and maintain the CWL Viewer with support from BioExcel, now maintained by Curii. We contributed to the BioExcel Building Blocks (BioBB) library, translating workflows into CWL and assisting with the development of their tutorial material and containers for Docker and Conda. We are also assisting with registering BioBB workflows on WorkflowHub.
See also full list of BioExcel deliverables.
- D1.5 – Long-term hardware and application roadmap
- D1.6 – Final project release
- Fourth project release of BioExcel core applications
- D1.7 – Update of whitepaper on scientific software development
- BioExcel Whitepaper on Scientific Software Development
- D2.1 – State-of-the-art and Initial Roadmap
- D2.2 – First release of workflow-ready building blocks library
- D2.3 – First release of demonstration workflows
- D2.4 – Development of a framework for the combination of HPC and HPDA operations
- D2.5 – Provision of a Workflow Environment at BioExcel portal
- D2.6 – Integration of BioExcel software framework with standards and international initiatives
- D2.7 – Final release of demonstration workflows
- Published in WorkflowHub
- D3.5 – Best Practice Guides
- D6.3 – High Performance Computing in support of COVID-19 Research
- D1.2 – First project release of all pilot applications
- D1.5 – Final project release of pilot applications
- D2.1 – State of the art and gap analysis
- D2.2 – First Release of Workflow Blocks and Portals
- D2.3 – User Feedback and Future Roadmap
- D2.4 – Final Release of Workflows and Modular Tools for User Services
- D3.1 – Selection and Establishment of User Groups
- D3.2 – Consultancy Proposals
- D3.5 – Consultancy Modalities and Funding Options, Final Update
See also full list of BioExcel publications.
Stian Soiland-Reyes, Peter Sefton, Mercè Crosas, Leyla Jael Castro, Frederik Coppens, José M. Fernández, Daniel Garijo, Björn Grüning, Marco La Rosa, Simone Leo, Eoghan Ó Carragáin, Marc Portier, Ana Trisovic, RO-Crate Community, Paul Groth, Carole Goble (2022):
Packaging research artefacts with RO-Crate.
Data Science 5(2)
Michael R. Crusoe, Sanne Abeln, Alexandru Iosup, Peter Amstutz, John Chilton, Nebojša Tijanić, Hervé Ménager, Stian Soiland-Reyes, Bogdan Gavrilović, Carole Goble, The CWL Community (2022):
Methods Included: Standardizing Computational Reuse and Portability with the Common Workflow Language.
Communications of the ACM 65(6)
Paul Brack, Peter Crowther, Stian Soiland-Reyes, Stuart Owen, Douglas Lowe, Alan R Williams, Quentin Groom, Mathias Dillen, Frederik Coppens, Björn Grüning, Ignacio Eguinoa, Philip Ewels, Carole Goble (2022):
Ten Simple Rules for making a software tool workflow-ready.
PLOS Computational Biology 18(3):e1009823
Stian Soiland-Reyes, Genís Bayarri, Pau Andrio, Robin Long, Douglas Lowe, Ania Niewielska, Adam Hospital, Paul Groth (2022):
Making Canonical Workflow Building Blocks interoperable across workflow languages.
Data Intelligence 4(2)
Genís Bayarri, Pau Andrio, Adam Hospital, Modesto Orozco, Josep Lluís Gelpí (2022):
BioExcel Building Blocks REST API (BioBB REST API), programmatic access to interoperable biomolecular simulation tools.
Rosa M. Badia, Javier Conejero, Jorge Ejarque, Daniele Lezzi, Francesc Lordan (2022):
PyCOMPSs as an instrument for Translational Computer Science.
Computing in Science and Engineering 24(2)
Genís Bayarri, Pau Andrio, Adam Hospital, Modesto Orozco, Josep Lluís Gelpí (2022):
BioExcel Building Blocks Workflows (BioBB-Wfs), an integrated web-based platform for biomolecular simulations.
Nucleic Acids Research 50(W1) https://doi.org/10.1093/nar/gkac380
Carole Goble, Sarah Cohen-Boulakia, Stian Soiland-Reyes, Daniel Garijo, Yolanda Gil, Michael R. Crusoe, Kristian Peters, Daniel Schober (2020):
FAIR Computational Workflows.
Data Intelligence 2(1):108–121
Farah Zaib Khan, Stian Soiland-Reyes, Richard O. Sinnott, Andrew Lonie, Carole Goble, Michael R. Crusoe (2019):
Sharing interoperable workflow provenance: A review of best practices and their practical application in CWLProv.
Pau Andrio, Adam Hospital, Javier Conejero, Luis Jordá, Marc Del Pino, Laia Codo, Stian Soiland-Reyes, Carole Goble, Daniele Lezzi, Rosa M. Badia, Modesto Orozco, Josep Ll. Gelpi (2019):
BioExcel Building Blocks, a software library for interoperable biomolecular simulation workflows.
Scientific Data 6:169
Gil Alterovitz, Dennis A Dean II, Carole Goble, Michael R Crusoe, Stian Soiland-Reyes, Amanda Bell, Anais Hayes, Anita Suresh, Charles Hadley S King IV, Dan Taylor, KanakaDurga Addepalli, Elaine Johanson, Elaine E Thompson, Eric Donaldson, Hiroki Morizono, Hsinyi Tsang, Jeet K Vora, Jeremy Goecks, Jianchao Yao, Jonas S Almeida, Jonathon Keeney, KanakaDurga Addepalli, Konstantinos Krampis, Krista Smith, Lydia Guo, Mark Walderhaug, Marco Schito, Matthew Ezewudo, Nuria Guimera, Paul Walsh, Robel Kahsay, Srikanth Gottipati, Timothy C Rodwell, Toby Bloom, Yuching Lai, Vahan Simonyan, Raja Mazumder (2018):
Enabling Precision Medicine via standard communication of NGS provenance, analysis, and results.
PLOS Biology. 16(12):e3000099
Steffen Möller, Stuart W. Prescott, Lars Wirzenius; Petter Reinholdtsen, Brad Chapman, Pjotr Prins, Stian Soiland-Reyes, Fabian Klötzl, Andrea Bagnacani, Matúš Kalaš, Andreas Tille, Michael R. Crusoe (2017):
Robust cross-platform workflows: how technical and scientific communities collaborate to develop, test and share best practices for data analysis..
Data Science and Engineering 2:232 pp 232–244.
Julie A McMurry, Nick Juty, Niklas Blomberg, Tony Burdett, Tom Conlin, Nathalie Conte, Mélanie Courtot, John Deck, Michel Dumontier, Donal K Fellows, Alejandra Gonzalez-Beltran, Philipp Gormanns, Jeffrey Grethe, Janna Hastings, Jean-Karim Hériché, Henning Hermjakob, Jon C Ison, Rafael C Jimenez, Simon Jupp, John Kunze, Camille Laibe, Nicolas Le Novère, James Malone, Maria Jesus Martin, Johanna R McEntyre, Chris Morris, Juha Muilu, Wolfgang Müller, Philippe Rocca-Serra, Susanna-Assunta Sansone, Murat Sariyar, Jacky L Snoep, Stian Soiland-Reyes, Natalie J Stanford, Neil Swainston, Nicole Washington, Alan R Williams, Sarala M Wimalaratne, Lilly M Winfree, Katherine Wolstencroft, Carole Goble, Cristopher J Mungall, Melissa A Haendel, Helen Parkinson (2017):
Identifiers for the 21st century: How to design, provision, and reuse identifiers to maximize utility and impact of life science data.
PLOS Biology 15(6):e2001414
Kyle Chard, Mike D’ Arcy, Ben Heavner, Ian Foster, Carl Kesselman, Ravi Madduri, Alexis Rodriguez, Stian Soiland-Reyes, Carole Goble, Kristi Clark, Eric W. Deutsch, Ivo Dinov, Nathan Price, Arthur Toga (2016):
I’ll Take That to Go: Big Data Bags and Minimal Identifiers for Exchange of Large, Complex Datasets.
IEEE International Conference on Big Data 2016 5 December 2016. [preprint]
https://doi.org/10.1109/BigData.2016.7840618 [pdf] [zenodo] [researchgate]