Public Dataset Benchmarks

This page outlines the plans and the instructions for the public dataset portion of the 2024 OpenFE Industry Benchmark study.

Overview

The Public Benchmarking Study concentrates on validating the use of the OpenFE Toolkit on publicly available protein-ligand binding datasets. Specifically we concentrate on re-calculating the Schrodinger public binding free energy benchmark set from the 2023 large scale study by Ross et al..

Figure 1: Overview of Public Dataset Benchmark process.

As outlined in Figure 1, the public benchmark will be split into four phases:

Remediation and validation of input structures from the Schrodinger public binding free energy benchmark set.
Simulating relevant free energy perturbations on each participating industry member’s own HPC resources.
Extracting and gathering results.
Processing results into a manuscript for submission to an appropriate journal.

At each stage of the process, the OpenFE team will provide necessary inputs and support to all participating members.

Before you start

Warning

Please do not run benchmarks with pre-existing installations of OpenFE. You should look at our specific benchmark openfe install instructions for how to set up openfe.

Before starting you should install openfe on any local machine you will use to prepare the benchmark systems and any HPC systems you will be running simulations on.

Please note that a very specific install of openfe and its dependencies is required in order to reduce any variance introduced by new software releases throughout the length of the study.

Phase 1: Preparing Inputs

In this phase, benchmark input structures will be prepared by industry partners for use with the OpenFE toolkit.

Start date: Mid June 2024

End date: Mid July 2024

Allocation Benchmark Systems

Each participating industry partner will be expected to select one or more system for benchmarking.

At a very minimum, we aim to simulate these entirety of the following three datasets:

JACS
Fragments
Merck

Please get in touch with the OpenFE team should you wish to select any new systems or if you wish to be reminded of which systems you have been allocated.

Input Data Source

Input structures, perturbation networks, and FEP+ results are taken from the v2.0 release of the Schrodinger 2023 Public Benchmark set.

For convenience, a snapshot of these inputs has been provided in the OpenFE 2024 benchmark repository.

Remediation of inputs

Input structures will need to be adapted in order to be used with the OpenFE toolkit.

Nominally input remediation will involve:

Extracting any cofactors from input PDB files
Fixing capping groups on terminal residues
Mutating any non-canonical amino acids
Addressing any issues with residue and atom names
Removing alternate ligand conformational or protonation states

Depending on the software used to prepare the inputs, the resulting files may need to be stripped of extraneous or sensitive metadata.

A list of instructions and conditions for preparing inputs are provided here. Updates will be made to these instructions based on feedback by benchmark partners.

Depositing inputs

All remediated inputs will be deposited in the OpenFE 2024 benchmark repository. Doing this will involve a lightweight review process by the OpenFE team, which will allow them to:

Gather preparation conditions to be included in relevant publications
Check-in with industry partners and gather feedback on the input preparation experience
Allow for the OpenFE team to help with any unanticipated issues

Please see the contributing inputs instructions for more details on this process.

Phase 2: Running Simulations

Warning

Please do not start Phase 2 until your prepared benchmark inputs have been approved by the OpenFE team.

In this phase, industry partners will run alchemical transformations for their allocated systems on their HPC resources.

Start date: Early July 2024

End date: End of August 2024

Please note that we expect the private dataset industry benchmark to start alongside this phase.

Simulation Planning: LOMAP networks

The setup of the relative binding free energy network is carried out using the planning script provided under utils/plan_rbfe_network.py.

This script will carry out the following steps:

Loading the ligands, protein, and (if provided) cofactors
Computing partial charges for ligands and cofactors using antechamber AM1BCC
Creating a network of ligand transformations using the Kartograf atom mapper, LOMAP scorer, and LOMAP network generator
Assigning settings to the transformations. Settings differ depending on whether the ligand transformation involves a change in net charge
- Non charge changing transformations: 11 lambda windows, 5 ns production run per lambda window
- Charge changing transformations: 22 lambda windows, 20 ns production run per lambda window
Creating the AlchemicalTransformations for solvent and complex legs and saving them to disc as json files

In an environment with OpenFE 1.0 installed, please run this script by calling:

# If you don’t have cofactors
python plan_rbfe_network.py --pdb protein.pdb --ligands ligands.sdf --output network_setup

# If you have cofactors
python plan_rbfe_network.py --pdb protein.pdb --ligands ligands.sdf --cofactors cofactors.sdf --output network_setup

This command will create a folder (named network_setup as specified using the --output flag) that contains a separate .json file for the solvent and complex legs for every edge in the network. The folder also contains a ligand_network.graphml file that is a serialized version of the LigandNetwork.

Warning

Since the partial charge assignment can be slow, we recommend putting the planning command in a bash script and executing it on a high performance workstation or HPC resource.

Simulation Execution

All planned simulations will be run by industry partners on their own clusters using OpenFE execution tooling, i.e. through the quickrun method. You can find additional information and examples on how to run simulations of the entire network in the “Running the simulations” section of our CLI tutorial. In this study, we will use a slightly modified approach to our CLI Tutorial, allowing for execution of a single task per HPC job.

Here is an example of a very simple script that will create and submit a separate job script (*.job named file) for every alchemical transformation (for the simplest SLURM use case):

for file in network_setup/transformations/*.json; do
  relpath="${file:30}"  # strip off "network_setup/"
  dirpath=${relpath%.*}  # strip off final ".json"
  jobpath="network_setup/transformations/${dirpath}.job"
  if [ -f "${jobpath}" ]; then
    echo "${jobpath} already exists"
    exit 1
  fi
  for repeat in {0..2}; do
    cmd="openfe quickrun ${file} -o results_${repeat}/${relpath} -d results_${repeat}/${dirpath}"
    echo -e "#!/usr/bin/env bash\n${cmd}" > "${jobpath}"
    sbatch "${jobpath}"
  done
done

Please reach out to the openfe team if you have any questions on how to adapt this script to your internal needs, we would be happy to assist with this.

Handling Failed Edges

It is possible that some of the simulations in the network fail. More specifically, not all repeats of an edge may finish successfully, or an edge can fail entirely. You should follow this strategy for dealing with those failures:

1. Non-reproducible failures

When at least one repeat of the edge completed successfully
Keep a log of the failure
Rerun the failed job(s) up to 3 times.
If the simulation repeat is still failing after 3 times, and the failure is due to NaN errors, then treat this as a reproducible failure.

2. Reproducible failures

When all repeats of an edge failed
Keep a log of the failure
Do not rerun this edge
If this is a redundant edge:
- Remove this edge from the network and carry out the analysis without this edge
If this is a non-redundant edge (meaning that removing this edge would lead to a disconnected graph):
- Add a new edge to the network. We provide a script that will automatically find a suitable new edge, more information can be found here.

Identifying Failed Edges

Failed edges can be identified in a few different ways:

Assuming you have no jobs running, there should be a 1-1 mapping between result json files and job work directories, for example:

$ ls
easy_rbfe_lig_ejm_31_complex_lig_ejm_46_complex/
easy_rbfe_lig_ejm_31_complex_lig_ejm_46_complex.json
easy_rbfe_lig_ejm_31_complex_lig_ejm_47_complex/
easy_rbfe_lig_ejm_31_complex_lig_ejm_47_complex.json
easy_rbfe_lig_ejm_31_complex_lig_ejm_48_complex/
easy_rbfe_lig_ejm_31_complex_lig_ejm_48_complex.json
easy_rbfe_lig_ejm_31_complex_lig_ejm_50_complex/
easy_rbfe_lig_ejm_31_complex_lig_ejm_50_complex.json
easy_rbfe_lig_ejm_31_solvent_lig_ejm_42_solvent/
easy_rbfe_lig_ejm_31_solvent_lig_ejm_42_solvent.json

If a directory like easy_rbfe_lig_ejm_31_solvent_lig_ejm_42_solvent/ exists but the corresponding json file doesn’t (easy_rbfe_lig_ejm_31_solvent_lig_ejm_42_solvent.json) then we know the edge failed. The directory should be removed and the job should be resubmitted (depending on the failure type as discussed above).

The extract results script mentioned here and the cleanup script mentioned here both include output of which folder(s) and json file(s) contain errors and can be removed prior to starting new jobs.

Fixing broken networks

If removing a reproducibly failing edge leads to a disconnected graph, new edges need to be added in order to fix the broken network. There are different ways to detect broken networks:

You experienced failures during the plotting script or the script for extracting results (e.g. ValueError: Computational results are not fully connected)
The number of ligands in the output dg.tsv file of the extract_results.py script is lower than the number of ligands the dataset should have

Fixing the disconnected network can be achieved using the script provided under utils/fix_networks.py.

The script finds new edges to repair the disconnected network (two edges per disconnected network) and produces the inputs necessary to run those additional edges.

Note that the script will throw an error if at least one repeat completed successfully. In that case (non-reproducible failure) we recommend re-running the failed jobs (see Handling failed edges

Here an example of how to run the script:

python fix_networks.py --input_alchem_network_file network_setup/alchemicalNetwork/alchemical_network.json --result_files results_*/*json --output_extra_transformations new_network_setup

The script takes as inputs the AlchemicalNetwork from the original setup (e.g. network_setup/alchemicalNetwork/alchemical_network.json), the result .json files (e.g. results_*/*json, and the folder name for storing the outputs (e.g. new_network_setup). This command will create a folder (named new_network_setup as specified using the --output_extra_transformations flag) that contains a transformations folder with a separate .json file for the solvent and complex legs for every new edge connecting the previously broken network. The new_network_setup folder also contains a ligand_network.graphml file as well as an alchemical_network.json file containing the serialized version of the new LigandNetwork and AlchemicalNetwork. These networks only contain the additional edges and not the full new network.

Once the inputs for the edges to fix the broken network have been created, you can submit those calculations as described in the Simulation execution section. Note that you will have to update the filepath to point to the new input .json files, e.g.

for file in new_network_setup/transformations/*.json; do
  relpath="${file:34}"  # strip off "new_network_setup/"
  dirpath=${relpath%.*}  # strip off final ".json"
  jobpath="new_network_setup/transformations/${dirpath}.job"
  if [ -f "${jobpath}" ]; then
    echo "${jobpath} already exists"
    exit 1
  fi
  for repeat in {0..2}; do
    cmd="openfe quickrun ${file} -o results_${repeat}/${relpath} -d results_${repeat}/${dirpath}"
    echo -e "#!/usr/bin/env bash\n${cmd}" > "${jobpath}"
    sbatch "${jobpath}"
  done
done

Inspecting Results

Note

A separate script is provided for gathering relevant FE output data in Phase 3.

Due to the slightly modified simulation execution layout, using openfe gather will not work in openfe v1.0.1.

Instead we recommend using the scripts we bundle in this repository.

Extracting ddG results

You can extract the ddG results in the following manner from the directory containing results_0, results_1, and results_2.

wget https://raw.githubusercontent.com/OpenFreeEnergy/IndustryBenchmarks2024/main/industry_benchmarks/utils/extras/extract_results.py
python extract_results.py --input_ligand_network_file network_setup/ligand_network.graphml

This will create a file named ddg.tsv from the individual simulation repeats.

If the simulations are complete, you can plot the ddg.tsv results against the Schrodinger results downloaded from the relevant systems’ ligand prediction file from: https://github.com/schrodinger/public_binding_free_energy_benchmark/tree/main/21_4_results/ligand_predictions

For example, for the JACS TYK2 set:

wget https://raw.githubusercontent.com/schrodinger/public_binding_free_energy_benchmark/v2.0/21_4_results/ligand_predictions/jacs_set/tyk2_out.csv
wget https://raw.githubusercontent.com/OpenFreeEnergy/IndustryBenchmarks2024/main/industry_benchmarks/utils/extras/plot_results.py
python plot_results.py --calculated ddg.tsv --experiment tyk2_out.csv

This will automatically generate plots of OpenFE vs experiment and FEP+ vs experiment using cinnabar.

Simulation Cleanup

The post-simulation cleanup script will reduce the amount of data you need to store after your simulations. It does not delete any data required for analysis. To obtain the script, download it with:

$ curl -LOJ https://raw.githubusercontent.com/OpenFreeEnergy/IndustryBenchmarks2024/main/industry_benchmarks/utils/results_cleanup.py

The cleanup script requires the path to the result.json file and also accepts a list of result.json files. Additionally, the script will detect if the result data has already been reduced and will skip processing that file. This means that you can run the script repeatedly with a glob as results are finished. For example:

$ micromamba activate openfe-benchmark  # don't forget to activate your conda environment first!
$ python results_cleanup.py path/to/results/*.json

Warning

Do not move data around before running the script. The script assumes that the result directory has not been moved.

Warning

Do not run multiple instances of the script at once using a wild card glob, i.e. *.json. The script does not use file locks to ensure multiple instances are not operating on the same file.

To save space as simulations complete, consider adding a line at the end of your job submission script that runs the cleaning script.

openfe quickrun "${file}" -o results_"${repeat}"/"${relpath}" -d results_"${repeat}"/"${dirpath}"
python results_cleanup.py results_"${repeat}"/"${relpath}"

The cleanup script will delete:

Redundant structural analysis data
Redundant input data
Simulation .nc & checkpoint .chk files

Note

The simulation .nc file is subsampled and for each lambda window, an .xtc trajectory file is created

Compute Requirements

To run the benchmark simulations following GPU hardware will be required:

An NVIDIA GPU (CUDA 10.2 or above compatible)
- In non-exclusive compute mode
A single GPU ID assigned per openfe quickrun execution
- e.g. by setting CUDA_VISIBLE_DEVICE if necessary
Estimated standard transformation compute time:
- Approximately 8-12 GPU hours per complex transformation repeat
- Approximately 1-2 GPU hours per solvent transformation repeat
Estimated net charge transformation compute time:
- Approximately 4-7 GPU days per complex transformation repeat
- Approximately 8-12 GPU hours per solvent transformation repeat

Data Storage Requirements

Warning

You will need to keep any temporary data until the post-simulation cleanup script is made available.

Temporary data:

Retention time: until the post-simulation cleanup script is applied to your data.

Estimated storage costs:

Standard simulations: 5-10 GB per triplicate repeat cycle
Net charge transformation: 40-80 GB per triplicate repeat cycle

This data contains the full simulation outputs, including large netcdf trajectories with 1 ps snapshots of the coordinates and energies. A cleanup script will be provided to extract relevant long-term data from these outputs.

Long term data:

Retention time: until the benchmark manuscript is accepted for publication. Estimated storage costs: sub 500 mb per triplicate repeat cycle.

This long term data will be extracted from above-mentioned temporary data using the post-simulation cleanup script.

It will include:

Reduced potential arrays for free energy analysis
OpenFE output JSON files
PNGs from structural analysis
XTC coordinate trajectories for each lambda window containing 21 evenly spaced frames

The data will be kept around by each partner for further processing should it be necessary as part of the manuscript writing process.

Phase 3: Data Analysis

In this phase, relevant simulation results will be gathered from industry partners.

Start date: Early November 2024

End date: Early December 2024

Industry partners are expected to post-process simulation outputs using a specialized script provided by the OpenFE team.

This script extracts:

Relevant free energy estimates, including time series of free energies
Simulation health metrics
- Overlap matrix and replica exchange probability plots
- Relevant structural analysis plots
Additional transformation information
- Additional metrics, including blinded transformation and ligand scores.

Industry partners are expected to submit this information back to OpenFE for analysis. A list of gathered data is provided to help with data review ahead of submission.

Gathering of results

Note

Make sure to run the Simulation Cleanup script before running the data gathering script! If you have not run the cleanup script, this data gathering script will throw an error.

Note

The data gathering script will throw an error if the network is broken or if any of the three repeats have not finished successfully.

If you have a broken network, please run the fix_networks script and any newly generated edges to obtain a connected network. You will then need to use the –fixed_alchemical_network option (see below) when calling the script.

If some of your repeats have failed, please re-run those as described here

Warning

By default the gathering script does not replace ligand names! For the public datasets, we want to use the same ligand names as in the input files. For the private datasets, if you used proprietary names, please use the --hide-ligand-names flag.

To perform the data gathering, do the following:

wget https://raw.githubusercontent.com/OpenFreeEnergy/IndustryBenchmarks2024/main/industry_benchmarks/utils/data_gathering.py
python data_gathering.py --input_alchemical_network network_setup/alchemical_network.json --results-folder results_0 --results-folder results_1 --results-folder results_2

If the fix network script was run, two alchemical networks need to be provided to the data gathering script, the original one and the one that was created after running the fix_networks.py:

python data_gathering.py --input_alchemical_network network_setup/alchemicalNetwork/alchemical_network.json --fixed_alchemical_network new_network_setup/alchemicalNetwork/alchemical_network.json --results-folder results_0 --results-folder results_1 --results-folder results_2

If confidential ligand names were used (private datasets), those can be replaced with generic ligand names. To do that you will need to add the flag --hide-ligand-names:

python data_gathering.py --input_alchemical_network network_setup/alchemicalNetwork/alchemical_network.json --results-folder results_0 --results-folder results_1 --results-folder results_2 --hide-ligand-names

Warning

Per default, the ligand names that were present in the original ligand sdf will be stored. Please check those and if they contain any confidential information, use the hide-ligand-names flag.

Content of the data that will be transferred to OpenFE

Here you can find the full list of files and data that will be extracted in the data gathering script.

1. Network properties JSON file

Note

All ligand properties gathered are, to best of our knowledge, blinded and cannot be used to recover the underlying structures.

A file containing a variety of network properties (all_network_properties.json). More specifically, the file contains the following information:

Edges: Following information is saved for each transformation:
- Lomap score
- Alchemical charge difference
- Charge score: A score that describes the difficulty of a transformation related to the absolute change difference between ligands
- Shape score
- Volume score
- Mapping RMSD score
- Number of heavy atoms in the common core
- Number of heavy atoms in the dummy region of ligand A
- Number of heavy atoms in the dummy region of ligand B
- Changing number of rings
- Changing number of rotatable bonds
- Morgan fingerprint Tanimoto similarity
- AtomPair fingerprint dice similarity
- Topological torsion fingerprint dice similarity
- Difference in solvent accessible surface area
- Whether or not the simulation failed
Ligands: Following information is saved for each ligand:
- Number of rotatable bonds
- Number of ring systems
- Number of heavy atoms
- Number of elements
- Solvent accessible surface area
- Formal charge
DDG_estimates: For each leg and each repeat, the MBAR DDG estimate and its uncertainty are stored

An example extract from the all_network_properties.json file is shown for the bace test system:

{
  "Ligands": {
    "spiro1": {
      "num_rotatable_bonds": 1,
      "num_rings": 4,
      "num_heavy_atoms": 26,
      "num_elements": 5,
      "solvent_accessible_surface_area": 387.43906527976293,
      "formal_charge": 1
    },
    "spiro2": {
      "num_rotatable_bonds": 1,
      "num_rings": 4,
      "num_heavy_atoms": 24,
      "num_elements": 5,
      "solvent_accessible_surface_area": 365.53534383347056,
      "formal_charge": 1
    }
  },
  "Edges": {
    "edge_spiro2_spiro1": {
      "ligand_A": "spiro2",
      "ligand_B": "spiro1",
      "lomap_score": 0.7788007830714049,
      "alchemical_charge_difference": 0,
      "charge_score": 1.0,
      "shape_score": 0.2709510437018804,
      "volume_score": 0.940014662311061,
      "mapping_rmsd_score": 0.28410096856069245,
      "num_heavy_core": 24,
      "num_heavy_dummy_A": 0,
      "num_heavy_dummy_B": 2,
      "difference_num_rings_AB": 0,
      "difference_num_rot_bonds_AB": 0,
      "morgan_tanimoto_similarity": 0.6966292134831461,
      "difference_solvent_accessible_surface_area": 21.903721446292366,
      "atom_pair_dice_similarity": 0.899527983816588,
      "topological_torsion_dice_similarity": 0.9166666666666666
    }
  },
  "DDG_estimates": {
    "solvent_spiro2_spiro1_repeat_0": [
      {
        "magnitude": 76.98924715451133,
        "unit": "kilocalorie_per_mole",
        ":is_custom:": true,
        "pint_unit_registry": "openff_units"
      },
      {
        "magnitude": 0.45136662410654116,
        "unit": "kilocalorie_per_mole",
        ":is_custom:": true,
        "pint_unit_registry": "openff_units"
      }
    ],
    "solvent_spiro2_spiro1_repeat_1": [...

2. Simulation files (reduced potentials, system information, and automated analyses)

Note

None of the files gathered contain any information which could be used to extract the underlying system’s chemistries.

For each transformation and each repeat, a folder is created that contains following files:

simulation_real_time_analysis.yaml: contains cumulative free energy estimates and simulation run time information
info.yaml: number of atoms, run time information (nanoseconds per day)
ligand_RMSD.txt: time series values for the ligand RMSD from the initial frame for each replicate
ligand_wander.txt: time series values for the ligand center of mass drift from its initial frame position for each replicate
protein_RMSD.txt: time series values for the protein RMSD from the initial frame for each replicate
protein_2D_RMSD.txt: time series values for the protein RMSD measured between every frame for each replicate
u_ln.txt: time series values for the reduced potential for each replicate
N_l.txt: the number of samples drawn from each replicate state
replicas_state_indices.txt: time series values of the replica state
time_ps.txt: the total simulation time in picoseconds at each sample
If present, following image analysis files are collected:
- forward_reverse_convergence.png: a time censured plot of the free energy estimates
- ligand_COM_drift.png: the ligands center of mass (COM) distance from its initial frame position
- ligand_RMSD.png: the RMSD of the ligand from the initial frame
- mbar_overlap_matrix.png: the MBAR overlap matrix
- protein_2D_RMSD.png: the 2D RMSD of the protein alpha carbons
- replica_exchange_matrix.png: the replica exchange matrix
- replica_state_timeseries.png: a plot of the replica state timeseries

The results directory should be organised as follows:

results_data
├── all_network_properties.json
├── complex_spiro2_spiro1_repeat_0
│   ├── ligand_RMSD.txt
│   ├── ligand_wander.txt
│   ├── protein_RMSD.txt
│   ├── protein_2D_RMSD.txt
│   ├── u_ln.txt
│   ├── N_l.txt
│   ├── replicas_state_indices.txt
│   ├── info.yaml
│   ├── simulation_real_time_analysis.yaml
│   └── time_ps.txt
├── solvent_spiro2_spiro1_repeat_0
│   ├── ligand_RMSD.txt
│   ├── ligand_wander.txt
│   ├── protein_RMSD.txt
│   ├── protein_2D_RMSD.txt
│   ├── u_ln.txt
│   ├── N_l.txt
│   ├── replicas_state_indices.txt
│   ├── info.yaml
│   ├── simulation_real_time_analysis.yaml
│   └── time_ps.txt
...

A zip archive of this data will also be created in the output folder called results_data.zip which should be uploaded to zenodo.

Uploading the results to zenodo

The results from this benchmarking study will be submitted to the OpenFE team via a zenodo upload.

To upload the data, please follow these steps:

Run the Simulation Cleanup script.
Run the data gathering script. The output of this step should be a compressed folder named results_data.zip.
Create a directory for each dataset, naming it after the dataset, e.g. jacs_bace. Then, move the results_data.zip file into the respective dataset folder. Please make sure to include both the set name (e.g. jacs or waterset) and the target name (e.g. bace) since some targets occur in multiple sets. For private datasets, please also copy the CSV file with the experimental data and the private info form into this same folder (see here for further instructions).

mkdir jacs_bace
mv results_data.zip jacs_bace
zip -r jacs_bace.zip jacs_bace

Upload the dataset zipped folder (that contains the compressed results) to zenodo.

Login to zenodo or sign up if you do not have an account.

Upload the data to the Open Free Energy Zenodo Community. You can find the OpenFE community here: https://zenodo.org/communities/openfreeenergy/records?q=&l=list&p=1&s=10&sort=newest.

Click the New upload button on the top right.

Fill out all the relevant information.

Digital Object Identifier: Click No and Generate a DOI now!

Resource type: Dataset

Title: For example something like: OpenFE 2024 Industry Benchmarking Project, Results from X (where X could be company name, or dataset name, or author.

Creators: Please add as authors everyone involved in running the calculations.

Description: Please report the OpenFE version that was used. You can get this information by running openfe --version in the commandline within the environment that you used to run the setup, simulations, and analysis of the data. In addition to that, please reference the Industry Benchmark repo as it contains all necessary information about the simulation details. E.g.: OpenFE 2024 Industry Benchmarking Project, Results from systems jacs set; bace and merck set; shp2. OpenFE version v1.0.1. Simulation details can be found in the GitHub repository https://github.com/OpenFreeEnergy/IndustryBenchmarks2024

Add the zipped dataset folder (e.g. jacs_bace.zip) that contains the compressed results folder (results_data.zip) to the zenodo upload. Please upload a separate zipped folder for each dataset.

Click the Submit for review button on the top right. We will then review the upload and publish it.

If you plan to submit the results from multiple datasets in a single zenodo upload, please make sure to create subfolders for the different datasets and to name them according to the set name and target name. Additionally, please list all the sets contained in the upload in the Description section.

Note

For the private datasets you will need to upload two additional files to zenodo, a CSV file with experimental data and the private dataset info form. You can find the instructions for uploading the private datasets here.

Analysis of results

Once submitted, the datasets will be analyzed by the OpenFE development team with help from volunteering industry board and technical advisory committee members.

Phase 4: Paper writing

Note

Details for phase 3 of the public dataset benchmarks are still being finalised. These will be updated as soon as possible!

Start date: Early November 2024

End date: December 2024

Drafting the manuscript

Once all results have been gathered, the OpenFE team alongside volunteer members of the OpenFE board and technical advisory committee will draft a manuscript for open access publication at a relevant journal.

Review of manuscript and authorship

All authors will be expected to review and approve the manuscript prior to journal submission. We anticipate doing this in a two round process, the first round where authors are invited to comment on the manuscript, followed by a second one for legal review by each partner organization.

Authorship will be offered to all those involved in the benchmarking process and inclusion will be left to the discretion of each organization.