.. _public_overview:

*************************
Public Dataset Benchmarks
*************************


This page outlines the plans and the instructions for the public dataset portion of the 2024 OpenFE Industry Benchmark study.


Overview
********


The Public Benchmarking Study concentrates on validating the use of the `OpenFE Toolkit <https://docs.openfree.energy/en/stable/>`_
on publicly available protein-ligand binding datasets. Specifically we concentrate
on re-calculating the `Schrodinger public binding free energy benchmark set <https://github.com/schrodinger/public_binding_free_energy_benchmark>`_
from the `2023 large scale study by Ross et al. <https://www.nature.com/articles/s42004-023-01019-9>`_.


.. figure:: public_overview.png
   :scale: 100%
   :align: center
   :alt: Figure 1 - Overview of Public Dataset Benchmark process.


*Figure 1: Overview of Public Dataset Benchmark process.*

As outlined in Figure 1, the public benchmark will be split into four phases:

1. Remediation and validation of input structures from the `Schrodinger public binding free energy benchmark set <https://github.com/schrodinger/public_binding_free_energy_benchmark>`_.
2. Simulating relevant free energy perturbations on each participating industry member’s own HPC resources.
3. Extracting and gathering results.
4. Processing results into a manuscript for submission to an appropriate journal.

At each stage of the process, the OpenFE team will provide necessary inputs and support to all participating members.


Before you start
****************

.. warning::
   Please do not run benchmarks with pre-existing installations of OpenFE.
   You should look at our specific :ref:`benchmark openfe install instructions <installation>`
   for how to set up openfe.


Before starting you should :ref:`install openfe <installation>` on any local machine you will use
to prepare the benchmark systems and any HPC systems you will be running simulations on.

Please note that a very specific install of `openfe` and its dependencies is required in
order to reduce any variance introduced by new software releases throughout the length
of the study.


.. _public_phase1:

Phase 1: Preparing Inputs
*************************

In this phase, benchmark input structures will be prepared by industry partners for use with the OpenFE toolkit.


**Start date:** *Mid June 2024*

**End date:** *Mid July 2024*


Allocation Benchmark Systems
============================

Each participating industry partner will be expected to select one or more system for benchmarking.

At a very minimum, we aim to simulate these entirety of the following three datasets:

* JACS
* Fragments
* Merck

Please :ref:`get in touch <get_in_touch>` with the OpenFE team should you wish to select
any new systems or if you wish to be reminded of which systems you have been allocated.


Input Data Source
=================

Input structures, perturbation networks, and FEP+ results are taken from the `v2.0 release of the Schrodinger 2023 Public Benchmark set <https://github.com/schrodinger/public_binding_free_energy_benchmark/tree/v2.0>`_.

For convenience, a `snapshot of these inputs <https://github.com/OpenFreeEnergy/IndustryBenchmarks2024/tree/main/industry_benchmarks/input_structures/original_structures>`_ has been provided in the OpenFE 2024 benchmark repository.


Remediation of inputs
=====================

Input structures will need to be adapted in order to be used with the OpenFE toolkit.

Nominally input remediation will involve:

* Extracting any cofactors from input PDB files
* Fixing capping groups on terminal residues
* Mutating any non-canonical amino acids
* Addressing any issues with residue and atom names
* Removing alternate ligand conformational or protonation states

Depending on the software used to prepare the inputs, the resulting files may need to be stripped of extraneous or sensitive metadata.

A list of instructions and conditions for preparing inputs are provided :ref:`here <input-preparation>`.
Updates will be made to these instructions based on feedback by benchmark partners.


.. toctree::
   :maxdepth: 1
   :hidden:

   input_preparation


Depositing inputs
=================

All remediated inputs will be deposited in the `OpenFE 2024 benchmark repository <https://github.com/OpenFreeEnergy/IndustryBenchmarks2024>`_.
Doing this will involve a lightweight review process by the OpenFE team, which will allow them to:

1. Gather preparation conditions to be included in relevant publications
2. Check-in with industry partners and gather feedback on the input preparation experience
3. Allow for the OpenFE team to help with any unanticipated issues


Please see the :ref:`contributing inputs instructions <contributing-inputs>` for more details on this process.


.. toctree::
   :maxdepth: 1
   :hidden:

   contributing_inputs


.. _public_phase2:

Phase 2: Running Simulations
****************************


.. warning::
   Please do not start Phase 2 until your prepared benchmark inputs have been approved by the OpenFE team.


In this phase, industry partners will run alchemical transformations for their allocated systems on their HPC resources.

**Start date:** *Early July 2024*

**End date:** *End of August 2024*

Please note that we expect the private dataset industry benchmark to start alongside this phase.

.. _simulation_planning:

Simulation Planning: LOMAP networks
===================================

The setup of the relative binding free energy network is carried out using the planning script provided under 
`utils/plan_rbfe_network.py <https://github.com/OpenFreeEnergy/IndustryBenchmarks2024/tree/main/industry_benchmarks/utils/plan_rbfe_network.py>`_.

This script will carry out the following steps:

* Loading the ligands, protein, and (if provided) cofactors
* Computing partial charges for ligands and cofactors using antechamber AM1BCC
* Creating a network of ligand transformations using the Kartograf atom mapper, LOMAP scorer, and LOMAP network generator
* Assigning settings to the transformations. Settings differ depending on whether the ligand transformation involves a change in net charge
   * Non charge changing transformations: 11 lambda windows, 5 ns production run per lambda window
   * Charge changing transformations: 22 lambda windows, 20 ns production run per lambda window
* Creating the ``AlchemicalTransformation``\ s for solvent and complex legs and saving them to disc as json files

In an environment with OpenFE 1.0 installed, please run this script by calling:

.. code-block:: bash

   # If you don’t have cofactors
   python plan_rbfe_network.py --pdb protein.pdb --ligands ligands.sdf --output network_setup

   # If you have cofactors
   python plan_rbfe_network.py --pdb protein.pdb --ligands ligands.sdf --cofactors cofactors.sdf --output network_setup

This command will create a folder (named ``network_setup`` as specified using the ``--output`` flag) that contains a separate ``.json`` file for the solvent and complex legs 
for every edge in the network. The folder also contains a ``ligand_network.graphml`` file that is a serialized version of the ``LigandNetwork``.

.. warning::
   Since the partial charge assignment can be slow, we recommend putting the planning command in a bash script and executing it on a high performance workstation or HPC resource. 

.. _simulation_execution:

Simulation Execution
====================

All planned simulations will be run by industry partners on their own clusters using OpenFE execution tooling,
i.e. through the `quickrun method <https://docs.openfree.energy/en/latest/guide/execution/quickrun_execution.html>`_.
You can find additional information and examples on how to run simulations of the entire network in the "Running the simulations" section of our `CLI tutorial <https://docs.openfree.energy/en/latest/tutorials/rbfe_cli_tutorial.html>`_.
In this study, we will use a slightly modified approach to our CLI Tutorial, allowing for execution of a single task per HPC job.

Here is an example of a very simple script that will create and submit a separate job script (`*.job` named file) for every alchemical transformation (for the simplest SLURM use case):

.. code-block:: bash

   for file in network_setup/transformations/*.json; do
     relpath="${file:30}"  # strip off "network_setup/"
     dirpath=${relpath%.*}  # strip off final ".json"
     jobpath="network_setup/transformations/${dirpath}.job"
     if [ -f "${jobpath}" ]; then
       echo "${jobpath} already exists"
       exit 1
     fi
     for repeat in {0..2}; do
       cmd="openfe quickrun ${file} -o results_${repeat}/${relpath} -d results_${repeat}/${dirpath}"
       echo -e "#!/usr/bin/env bash\n${cmd}" > "${jobpath}"
       sbatch "${jobpath}"
     done 
   done

Please reach out to the openfe team if you have any questions on how to adapt this script to your internal needs, we would be happy to assist with this.


.. _failed_edges:

Handling Failed Edges
=====================

It is possible that some of the simulations in the network fail.
More specifically, not all repeats of an edge may finish successfully, or an edge can fail entirely.
You should follow this strategy for dealing with those failures:

**1. Non-reproducible failures**

* When at least one repeat of the edge completed successfully
* Keep a log of the failure
* Rerun the failed job(s) up to 3 times.
* If the simulation repeat is still failing after 3 times, and the failure is due to NaN errors, then treat this as a **reproducible failure**.

**2. Reproducible failures**

* When all repeats of an edge failed
* Keep a log of the failure
* Do not rerun this edge
* If this is a redundant edge:

  * Remove this edge from the network and carry out the analysis without this edge

* If this is a non-redundant edge (meaning that removing this edge would lead to a disconnected graph):

  * Add a new edge to the network. We provide a script that will automatically find a suitable new edge, more information can be found :ref:`here <fix_networks>`.
 
Identifying Failed Edges
------------------------

Failed edges can be identified in a few different ways:

1. Assuming you have no jobs running, there should be a 1-1 mapping between result ``json`` files and job work directories, for example:

.. code-block:: bash

   $ ls
   easy_rbfe_lig_ejm_31_complex_lig_ejm_46_complex/
   easy_rbfe_lig_ejm_31_complex_lig_ejm_46_complex.json
   easy_rbfe_lig_ejm_31_complex_lig_ejm_47_complex/
   easy_rbfe_lig_ejm_31_complex_lig_ejm_47_complex.json
   easy_rbfe_lig_ejm_31_complex_lig_ejm_48_complex/
   easy_rbfe_lig_ejm_31_complex_lig_ejm_48_complex.json
   easy_rbfe_lig_ejm_31_complex_lig_ejm_50_complex/
   easy_rbfe_lig_ejm_31_complex_lig_ejm_50_complex.json
   easy_rbfe_lig_ejm_31_solvent_lig_ejm_42_solvent/
   easy_rbfe_lig_ejm_31_solvent_lig_ejm_42_solvent.json

If a directory like ``easy_rbfe_lig_ejm_31_solvent_lig_ejm_42_solvent/`` exists but the corresponding ``json`` file doesn't (``easy_rbfe_lig_ejm_31_solvent_lig_ejm_42_solvent.json``) then we know the edge failed.
The directory should be removed and the job should be resubmitted (depending on the failure type as discussed above).

2. The extract results script mentioned :ref:`here <inspecting results>` and the cleanup script mentioned :ref:`here <post-simulation cleanup>` both include output of which folder(s) and ``json`` file(s) contain errors and can be removed prior to starting new jobs.

.. _fix_networks:

Fixing broken networks
----------------------

If removing a reproducibly failing edge leads to a disconnected graph, new edges need to be added in order to fix the broken network.
There are different ways to detect broken networks:

* You experienced failures during the plotting script or the script for extracting results (e.g. ``ValueError: Computational results are not fully connected``)
* The number of ligands in the output ``dg.tsv`` file of the ``extract_results.py`` script is lower than the number of ligands the dataset should have

Fixing the disconnected network can be achieved using the script provided under
`utils/fix_networks.py <https://github.com/OpenFreeEnergy/IndustryBenchmarks2024/tree/main/industry_benchmarks/utils/fix_networks.py>`_.

The script finds new edges to repair the disconnected network
(two edges per disconnected network) and produces the inputs necessary to run those additional edges.

Note that the script will throw an error if at least one repeat completed successfully. In that case (non-reproducible failure) we recommend re-running the failed jobs (see :ref:`Handling failed edges <failed_edges>`

Here an example of how to run the script:

.. code-block:: bash

   python fix_networks.py --input_alchem_network_file network_setup/alchemicalNetwork/alchemical_network.json --result_files results_*/*json --output_extra_transformations new_network_setup

The script takes as inputs the ``AlchemicalNetwork`` from the original setup (e.g. ``network_setup/alchemicalNetwork/alchemical_network.json``), the result ``.json`` files (e.g. ``results_*/*json``, and the folder name for storing the outputs (e.g. ``new_network_setup``).
This command will create a folder (named ``new_network_setup`` as specified using the ``--output_extra_transformations`` flag) that contains a ``transformations`` folder with a separate .json file for the solvent and complex legs for every new edge connecting the previously broken network.
The ``new_network_setup`` folder also contains a ``ligand_network.graphml`` file as well as an ``alchemical_network.json`` file containing the serialized version of the new ``LigandNetwork`` and ``AlchemicalNetwork``. These networks only contain the additional edges and not the full new network.

Once the inputs for the edges to fix the broken network have been created, you can submit those calculations as described in the :ref:`Simulation execution section <simulation_execution>`.
Note that you will have to update the filepath to point to the new input .json files, e.g.

.. code-block:: bash

   for file in new_network_setup/transformations/*.json; do
     relpath="${file:34}"  # strip off "new_network_setup/"
     dirpath=${relpath%.*}  # strip off final ".json"
     jobpath="new_network_setup/transformations/${dirpath}.job"
     if [ -f "${jobpath}" ]; then
       echo "${jobpath} already exists"
       exit 1
     fi
     for repeat in {0..2}; do
       cmd="openfe quickrun ${file} -o results_${repeat}/${relpath} -d results_${repeat}/${dirpath}"
       echo -e "#!/usr/bin/env bash\n${cmd}" > "${jobpath}"
       sbatch "${jobpath}"
     done
   done


Inspecting Results
==================

.. _inspecting results:


.. note::
   A separate script is provided for :ref:`gathering relevant FE output data <gathering_of_results>` in Phase 3.


Due to the slightly modified simulation execution layout, using `openfe gather` will not work in openfe v1.0.1.

Instead we recommend using the scripts we bundle in this repository.

**Extracting ddG results**

You can extract the ddG results in the following manner from the directory containing `results_0`, `results_1`, and `results_2`.


.. code-block:: bash

   wget https://raw.githubusercontent.com/OpenFreeEnergy/IndustryBenchmarks2024/main/industry_benchmarks/utils/extras/extract_results.py
   python extract_results.py --input_ligand_network_file network_setup/ligand_network.graphml


This will create a file named `ddg.tsv` from the individual simulation repeats.

If the simulations are complete, you can plot the `ddg.tsv` results
against the Schrodinger results downloaded from the relevant systems' ligand prediction file
from: https://github.com/schrodinger/public_binding_free_energy_benchmark/tree/main/21_4_results/ligand_predictions

For example, for the JACS TYK2 set:


.. code-block:: bash

   wget https://raw.githubusercontent.com/schrodinger/public_binding_free_energy_benchmark/v2.0/21_4_results/ligand_predictions/jacs_set/tyk2_out.csv
   wget https://raw.githubusercontent.com/OpenFreeEnergy/IndustryBenchmarks2024/main/industry_benchmarks/utils/extras/plot_results.py
   python plot_results.py --calculated ddg.tsv --experiment tyk2_out.csv


This will automatically generate plots of OpenFE vs experiment and FEP+ vs experiment using cinnabar.

.. _post-simulation cleanup:

Simulation Cleanup
==================

The post-simulation cleanup script will reduce the amount of data you need to store after your simulations.
It does not delete any data required for analysis.
To obtain the script, download it with:

.. code-block:: bash

   $ curl -LOJ https://raw.githubusercontent.com/OpenFreeEnergy/IndustryBenchmarks2024/main/industry_benchmarks/utils/results_cleanup.py

The cleanup script requires the path to the ``result.json`` file and also accepts a list of ``result.json`` files.
Additionally, the script will detect if the result data has already been reduced and will skip processing that file.
This means that you can run the script repeatedly with a glob as results are finished.
For example:

.. code-block:: bash

   $ micromamba activate openfe-benchmark  # don't forget to activate your conda environment first!
   $ python results_cleanup.py path/to/results/*.json

.. warning::
   Do not move data around before running the script.
   The script assumes that the result directory has not been moved.

.. warning::
   Do not run multiple instances of the script at once using a wild card glob, i.e. ``*.json``.
   The script does not use file locks to ensure multiple instances are not operating on the same file.

To save space as simulations complete, consider adding a line at the end of your job submission script that runs the cleaning script.

.. code-block:: bash
   
   openfe quickrun "${file}" -o results_"${repeat}"/"${relpath}" -d results_"${repeat}"/"${dirpath}"
   python results_cleanup.py results_"${repeat}"/"${relpath}"

The cleanup script will delete:

* Redundant structural analysis data
* Redundant input data
* Simulation ``.nc`` & checkpoint ``.chk`` files 

.. note:: 
   The simulation ``.nc`` file is subsampled and for each lambda window, an ``.xtc`` trajectory file is created


Compute Requirements
====================

To run the benchmark simulations following **GPU hardware** will be required:

* An NVIDIA GPU (CUDA 10.2 or above compatible)
    * In non-exclusive compute mode
* A single GPU ID assigned per `openfe quickrun` execution
    * e.g. by setting `CUDA_VISIBLE_DEVICE` if necessary
* Estimated **standard** transformation compute time:
    * Approximately 8-12 GPU hours per complex transformation repeat
    * Approximately 1-2 GPU hours per solvent transformation repeat
* Estimated **net charge** transformation compute time:
    * Approximately 4-7 GPU days per complex transformation repeat
    * Approximately 8-12 GPU hours per solvent transformation repeat


Data Storage Requirements
=========================

.. _data storage requirements:

.. warning::
   You will need to keep any temporary data until the `post-simulation cleanup`_ script is made available.


**Temporary data:**

*Retention time:* until the `post-simulation cleanup`_ script is applied to your data.

*Estimated storage costs:*
  * Standard simulations: **5-10 GB** per triplicate repeat cycle
  * Net charge transformation: **40-80 GB** per triplicate repeat cycle

This data contains the full simulation outputs, including large netcdf trajectories with 1 ps snapshots of the coordinates and energies. A cleanup script will be provided to extract relevant long-term data from these outputs.

**Long term data:**

*Retention time:* until the benchmark manuscript is accepted for publication.
*Estimated storage costs:* **sub 500 mb** per triplicate repeat cycle.

This long term data will be extracted from above-mentioned temporary data using the `post-simulation cleanup`_ script.

It will include:
  * Reduced potential arrays for free energy analysis
  * OpenFE output JSON files
  * PNGs from structural analysis
  * XTC coordinate trajectories for each lambda window containing 21 evenly spaced frames

The data will be kept around by each partner for further processing should it be necessary as part of the manuscript writing process.

.. _public_phase3:

Phase 3: Data Analysis
**********************


In this phase, relevant simulation results will be gathered from industry partners.

**Start date:** *Early November 2024*

**End date:** *Early December 2024*


Industry partners are expected to post-process simulation outputs using a specialized script provided by the OpenFE team.

This script extracts:

* Relevant free energy estimates, including time series of free energies
* Simulation health metrics

  * Overlap matrix and replica exchange probability plots
  * Relevant structural analysis plots

* Additional transformation information

  * Additional metrics, including blinded transformation and ligand scores.

Industry partners are expected to submit this information back to OpenFE for
analysis. A list of :ref:`gathered data <gathered_data>` is provided
to help with data review ahead of submission.


.. _gathering_of_results:

Gathering of results
====================


.. note::
  Make sure to run the :ref:`post-simulation cleanup` script before running the data gathering script!
  If you have not run the cleanup script, this data gathering script will throw an error.


.. note::
  The data gathering script will throw an error if the network is broken or if any of the three repeats have not finished successfully.

    * If you have a broken network, please run the `fix_networks`_ script and any newly generated edges to obtain a connected network. You will then need to use the `--fixed_alchemical_network` option (see below) when calling the script.
    * If some of your repeats have failed, please re-run those as described `here <failed_edges>`_


.. warning::
  By default the gathering script does not replace ligand names!
  For the public datasets, we want to use the same ligand names as in the input files.
  For the private datasets, if you used proprietary names, please use the ``--hide-ligand-names`` flag.


To perform the data gathering, do the following:

.. code-block:: bash

   wget https://raw.githubusercontent.com/OpenFreeEnergy/IndustryBenchmarks2024/main/industry_benchmarks/utils/data_gathering.py
   python data_gathering.py --input_alchemical_network network_setup/alchemical_network.json --results-folder results_0 --results-folder results_1 --results-folder results_2

If the `fix network script <fix_networks>`_ was run, two alchemical networks need to be provided to the data gathering script, the original one and the one that was created after running the ``fix_networks.py``:

.. code-block:: bash

   python data_gathering.py --input_alchemical_network network_setup/alchemicalNetwork/alchemical_network.json --fixed_alchemical_network new_network_setup/alchemicalNetwork/alchemical_network.json --results-folder results_0 --results-folder results_1 --results-folder results_2

If confidential ligand names were used (private datasets), those can be replaced with generic ligand names. To do that you will need to add the flag ``--hide-ligand-names``:

.. code-block:: bash

   python data_gathering.py --input_alchemical_network network_setup/alchemicalNetwork/alchemical_network.json --results-folder results_0 --results-folder results_1 --results-folder results_2 --hide-ligand-names

.. warning::
   Per default, the ligand names that were present in the original ligand sdf will be stored. Please check those and if they contain any confidential information, use the ``hide-ligand-names`` flag.


.. _gathered_data:

Content of the data that will be transferred to OpenFE
======================================================

Here you can find the full list of files and data that will be extracted in the `data gathering script <gathering_of_results>`_.


1. Network properties JSON file
-------------------------------


.. note::
   All ligand properties gathered are, to best of our knowledge, blinded and cannot be used to recover the underlying structures.


A file containing a variety of network properties (``all_network_properties.json``).
More specifically, the file contains the following information:

* ``Edges``: Following information is saved for each transformation:

  * Lomap score
  * Alchemical charge difference
  * Charge score: A score that describes the difficulty of a transformation related to the absolute change difference between ligands
  * Shape score
  * Volume score
  * Mapping RMSD score
  * Number of heavy atoms in the common core
  * Number of heavy atoms in the dummy region of ligand A
  * Number of heavy atoms in the dummy region of ligand B
  * Changing number of rings
  * Changing number of rotatable bonds
  * Morgan fingerprint Tanimoto similarity
  * AtomPair fingerprint dice similarity
  * Topological torsion fingerprint dice similarity
  * Difference in solvent accessible surface area
  * Whether or not the simulation failed

* ``Ligands``: Following information is saved for each ligand:

  * Number of rotatable bonds
  * Number of ring systems
  * Number of heavy atoms
  * Number of elements
  * Solvent accessible surface area
  * Formal charge

* ``DDG_estimates``: For each leg and each repeat, the MBAR DDG estimate and its uncertainty are stored

An example extract from the ``all_network_properties.json`` file is shown for the bace test system:

.. code-block:: bash

    {
      "Ligands": {
        "spiro1": {
          "num_rotatable_bonds": 1,
          "num_rings": 4,
          "num_heavy_atoms": 26,
          "num_elements": 5,
          "solvent_accessible_surface_area": 387.43906527976293,
          "formal_charge": 1
        },
        "spiro2": {
          "num_rotatable_bonds": 1,
          "num_rings": 4,
          "num_heavy_atoms": 24,
          "num_elements": 5,
          "solvent_accessible_surface_area": 365.53534383347056,
          "formal_charge": 1
        }
      },
      "Edges": {
        "edge_spiro2_spiro1": {
          "ligand_A": "spiro2",
          "ligand_B": "spiro1",
          "lomap_score": 0.7788007830714049,
          "alchemical_charge_difference": 0,
          "charge_score": 1.0,
          "shape_score": 0.2709510437018804,
          "volume_score": 0.940014662311061,
          "mapping_rmsd_score": 0.28410096856069245,
          "num_heavy_core": 24,
          "num_heavy_dummy_A": 0,
          "num_heavy_dummy_B": 2,
          "difference_num_rings_AB": 0,
          "difference_num_rot_bonds_AB": 0,
          "morgan_tanimoto_similarity": 0.6966292134831461,
          "difference_solvent_accessible_surface_area": 21.903721446292366,
          "atom_pair_dice_similarity": 0.899527983816588,
          "topological_torsion_dice_similarity": 0.9166666666666666
        }
      },
      "DDG_estimates": {
        "solvent_spiro2_spiro1_repeat_0": [
          {
            "magnitude": 76.98924715451133,
            "unit": "kilocalorie_per_mole",
            ":is_custom:": true,
            "pint_unit_registry": "openff_units"
          },
          {
            "magnitude": 0.45136662410654116,
            "unit": "kilocalorie_per_mole",
            ":is_custom:": true,
            "pint_unit_registry": "openff_units"
          }
        ],
        "solvent_spiro2_spiro1_repeat_1": [...


2. Simulation files (reduced potentials, system information, and automated analyses)
------------------------------------------------------------------------------------

.. note::
   None of the files gathered contain any information which could be used to extract the underlying system's chemistries.


For each transformation and each repeat, a folder is created that contains following files:

* ``simulation_real_time_analysis.yaml``: contains cumulative free energy estimates and simulation run time information
* ``info.yaml``: number of atoms, run time information (nanoseconds per day)
* ``ligand_RMSD.txt``: time series values for the ligand RMSD from the initial frame for each replicate
* ``ligand_wander.txt``: time series values for the ligand center of mass drift from its initial frame position for each replicate
* ``protein_RMSD.txt``: time series values for the protein RMSD from the initial frame for each replicate
* ``protein_2D_RMSD.txt``: time series values for the protein RMSD measured between every frame for each replicate
* ``u_ln.txt``: time series values for the reduced potential for each replicate
* ``N_l.txt``: the number of samples drawn from each replicate state
* ``replicas_state_indices.txt``: time series values of the replica state
* ``time_ps.txt``: the total simulation time in picoseconds at each sample
* If present, following image analysis files are collected:

  * ``forward_reverse_convergence.png``: a time censured plot of the free energy estimates
  * ``ligand_COM_drift.png``: the ligands center of mass (COM) distance from its initial frame position
  * ``ligand_RMSD.png``: the RMSD of the ligand from the initial frame
  * ``mbar_overlap_matrix.png``: the MBAR overlap matrix
  * ``protein_2D_RMSD.png``: the 2D RMSD of the protein alpha carbons
  * ``replica_exchange_matrix.png``: the replica exchange matrix
  * ``replica_state_timeseries.png``: a plot of the replica state timeseries

The results directory should be organised as follows:

.. code-block:: bash

    results_data
    ├── all_network_properties.json
    ├── complex_spiro2_spiro1_repeat_0
    │   ├── ligand_RMSD.txt
    │   ├── ligand_wander.txt
    │   ├── protein_RMSD.txt
    │   ├── protein_2D_RMSD.txt
    │   ├── u_ln.txt
    │   ├── N_l.txt
    │   ├── replicas_state_indices.txt
    │   ├── info.yaml
    │   ├── simulation_real_time_analysis.yaml
    │   └── time_ps.txt
    ├── solvent_spiro2_spiro1_repeat_0
    │   ├── ligand_RMSD.txt
    │   ├── ligand_wander.txt
    │   ├── protein_RMSD.txt
    │   ├── protein_2D_RMSD.txt
    │   ├── u_ln.txt
    │   ├── N_l.txt
    │   ├── replicas_state_indices.txt
    │   ├── info.yaml
    │   ├── simulation_real_time_analysis.yaml
    │   └── time_ps.txt
    ...

A zip archive of this data will also be created in the output folder called ``results_data.zip`` which should be uploaded to zenodo.

.. _upload_of_results:

Uploading the results to zenodo
===============================

The results from this benchmarking study will be submitted to the OpenFE team via a zenodo upload.

To upload the data, please follow these steps:

1. Run the :ref:`post-simulation cleanup` script.
2. Run the :ref:`data gathering script <gathering_of_results>`. The output of this step should be a compressed folder named ``results_data.zip``.
3. Create a directory for each dataset, naming it after the dataset, e.g. ``jacs_bace``. Then, move the  ``results_data.zip`` file into the respective dataset folder. Please make sure to include both the set name (e.g. jacs or waterset) and the target name (e.g. bace) since some targets occur in multiple sets.
   For private datasets, please also copy the CSV file with the experimental data and the private info form into this same folder (see :ref:`here <private-data-gather>` for further instructions).

.. code-block:: bash

    mkdir jacs_bace
    mv results_data.zip jacs_bace
    zip -r jacs_bace.zip jacs_bace

4. Upload the dataset zipped folder (that contains the compressed results) to zenodo.

  a) Login to zenodo or sign up if you do not have an account.
  b) Upload the data to the **Open Free Energy Zenodo Community**. You can find the OpenFE community here: `https://zenodo.org/communities/openfreeenergy/records?q=&l=list&p=1&s=10&sort=newest <https://zenodo.org/communities/openfreeenergy/records?q=&l=list&p=1&s=10&sort=newest>`_.
  c) Click the ``New upload`` button on the top right.
  d) Fill out all the relevant information.

     * Digital Object Identifier: Click ``No`` and ``Generate a DOI now!``
     * Resource type: ``Dataset``
     * Title: For example something like: OpenFE 2024 Industry Benchmarking Project, Results from X (where X could be company name, or dataset name, or author.
     * Creators: Please add as authors everyone involved in running the calculations.
     * Description: Please report the OpenFE version that was used. You can get this information by running ``openfe --version`` in the commandline within the environment that you used to run the setup, simulations, and analysis of the data. In addition to that, please reference the `Industry Benchmark repo <https://github.com/OpenFreeEnergy/IndustryBenchmarks2024>`_ as
       it contains all necessary information about the simulation details.
       E.g.: OpenFE 2024 Industry Benchmarking Project, Results from systems `jacs set; bace` and `merck set; shp2`. OpenFE version v1.0.1.  Simulation details can be found in the GitHub repository `https://github.com/OpenFreeEnergy/IndustryBenchmarks2024`

  e) Add the zipped dataset folder (e.g. ``jacs_bace.zip``) that contains the compressed results folder (``results_data.zip``) to the zenodo upload. Please upload a separate zipped folder for each dataset.
  f) Click the ``Submit for review`` button on the top right. We will then review the upload and publish it.

If you plan to submit the results from multiple datasets in a single zenodo upload, please make sure to create subfolders for the different datasets
and to name them according to the set name and target name. Additionally, please list all the sets contained in the upload in the `Description` section.

.. note::
  For the private datasets you will need to upload two additional files to zenodo, a CSV file with experimental data and
  the :ref:`private dataset info form <private info form>`. You can find the instructions for uploading the private datasets :ref:`here <private-data-gather>`.


Analysis of results
===================

Once submitted, the datasets will be analyzed by the OpenFE development team with
help from volunteering industry board and technical advisory committee members.


Phase 4: Paper writing
**********************


.. note::
   Details for phase 3 of the public dataset benchmarks are still being finalised. These will be updated as soon as possible!


**Start date:** *Early November 2024*

**End date:** *December 2024*


Drafting the manuscript
=======================

Once all results have been gathered, the OpenFE team alongside volunteer members of
the OpenFE board and technical advisory committee will draft a manuscript for open access
publication at a relevant journal.

Review of manuscript and authorship
===================================

All authors will be expected to review and approve the manuscript prior to journal submission.
We anticipate doing this in a two round process, the first round where authors are invited
to comment on the manuscript, followed by a second one for legal review by each partner organization.

Authorship will be offered to all those involved in the benchmarking process and inclusion will be
left to the discretion of each organization.