chemistry
biology

Add ByteFF-GNN and ByteFF-Pol v2; keep legacy v1 at trained_models/, valid_data/

#3
by xux1ngyuan - opened
ByteFF-GNN/example_data/hessian_example.csv ADDED
The diff for this file is too large to render. See raw diff
 
ByteFF-GNN/example_data/hessian_example.h5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2181c999d064e856f342e69af9a8263158e5b8fcda42793817d43f90bd2d7575
3
+ size 48510784
ByteFF-GNN/example_data/torsion_example.csv ADDED
The diff for this file is too large to render. See raw diff
 
ByteFF-GNN/example_data/torsion_example.h5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:265907c1aca2f4e67f2777ccc99268a6d4ea9fe11513fa4c6a4de5a2879c082e
3
+ size 66131920
ByteFF-Pol/trained_models_v2/fftrainer_config_in_use.yaml ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ model:
2
+ ff_block:
3
+ - act: gelu
4
+ out_mlp_dims:
5
+ - 256
6
+ - 4
7
+ post_mlp_dims:
8
+ - 256
9
+ - 256
10
+ - 1
11
+ pre_mlp_dims:
12
+ - 256
13
+ - 256
14
+ - 3
15
+ tanh_output: 20.0
16
+ type: MMBondedConj
17
+ - act: gelu
18
+ out_mlp_dims:
19
+ - 256
20
+ - 3
21
+ pre_mlp_dims:
22
+ - 256
23
+ - 256
24
+ - 3
25
+ type: ChargeVolume
26
+ - act: gelu
27
+ charge14: 0.5
28
+ disp_damping_factor: 120.0
29
+ fix_li_alpha: 1.0e-06
30
+ ind14: 0.5
31
+ li_damp_clip: 0.2
32
+ out_mlp_dims:
33
+ - 256
34
+ - 3
35
+ pol_damping_factor: 0.39
36
+ pre_mlp_dims:
37
+ - 256
38
+ - 256
39
+ - 3
40
+ s12: 1.5
41
+ type: Exp6Pol
42
+ vdw14: 0.5
43
+ graph_block:
44
+ feature_layer:
45
+ act: gelu
46
+ atom_embedding_dim: 32
47
+ edge_mlp_dims:
48
+ - 64
49
+ - 256
50
+ - 2
51
+ node_mlp_dims:
52
+ - 64
53
+ - 256
54
+ - 2
55
+ gnn_layer:
56
+ act: gelu
57
+ at_channels: 16
58
+ ffn_dims:
59
+ - 256
60
+ - 2
61
+ gnn_dims:
62
+ - 256
63
+ - 256
64
+ - 4
65
+ gnn_type: EGT
66
+ heads: 16
67
+ jk: cat
ByteFF-Pol/trained_models_v2/optimal.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:04d6d1472e75d2a514d29fb5b2ee4d2a96a259c5154fdfb30a1b13bd70968296
3
+ size 82200474
README.md CHANGED
@@ -6,28 +6,45 @@ tags:
6
  ---
7
  # ByteFF2
8
 
9
- This repository contains the model used for the paper [Bridging Quantum Mechanics to Organic Liquid Properties via a Universal Force Field](https://arxiv.org/abs/2508.08575)。
10
 
11
- [ByteFF-Pol](https://arxiv.org/abs/2508.08575) is a polarizable force field parameterized by a graph neural network (GNN), trained on high-level quantum mechanics (QM) data, thus eliminating the need for experimental calibration. ByteFF-Pol achieves exceptional accuracy in predicting the thermodynamic and transport properties of small-molecule liquids and electrolytes, outperforming SOTA traditional and ML force fields
 
 
12
 
13
  # Trained Models
14
- The `trained_models` folder contains the trained model for ByteFF-Pol and its corresponding configuration (.yaml) file.
 
 
 
 
15
 
16
  # How to use
17
  Code and examples are available in the [byteff2](https://github.com/ByteDance-Seed/byteff2) repository.
18
 
19
  ## Citation
20
- If you find ByteFF-Pol is useful for your research and applications, feel free to give us a star ⭐ or cite us using:
21
 
22
  ```bibtex
23
 
24
- @misc{zheng2025bridgingquantummechanicsorganic,
25
- title = {Bridging Quantum Mechanics to Organic Liquid Properties via a Universal Force Field},
26
- author = {Tianze Zheng and Xingyuan Xu and Zhi Wang and Xu Han and Zhenliang Mu and Ziqing Zhang and Sheng Gong and Kuang Yu and Wen Yan},
27
- year = {2025},
28
- eprint = {2508.08575},
29
- archivePrefix = {arXiv},
30
- primaryClass = {physics.comp-ph},
31
- url = {https://arxiv.org/abs/2508.08575}
32
  }
 
 
 
 
 
 
 
 
 
 
 
 
33
  ```
 
6
  ---
7
  # ByteFF2
8
 
9
+ This repository contains the model and example data used for ByteFF-GNN and ByteFF-Pol.
10
 
11
+ * [ByteFF-GNN](https://pubs.rsc.org/en/content/articlehtml/2025/sc/d4sc06640e) is a molecular mechanics force field parameterized by an edge-augmented, symmetry-preserving graph neural network (GNN), trained on large-scale high-level quantum mechanics (QM) data. ByteFF-GNN enables fast, one-pass prediction of bonded and non-bonded parameters for drug-like molecules, achieving SOTA accuracy across torsional energy profiles, relaxed geometries, conformational energies, and off-equilibrium energies and forces.
12
+
13
+ * [ByteFF-Pol](https://www.nature.com/articles/s41467-026-73566-3) is a polarizable force field parameterized by a graph neural network (GNN), trained on high-level quantum mechanics (QM) data, thus eliminating the need for experimental calibration. ByteFF-Pol achieves exceptional accuracy in predicting the thermodynamic and transport properties of small-molecule liquids and electrolytes, outperforming SOTA traditional and ML force fields.
14
 
15
  # Trained Models
16
+ * **ByteFF-Pol**: Two model versions are provided:
17
+ - `ByteFF-Pol/trained_models_v2/` — latest model (**recommended**).
18
+ - `trained_models/` — legacy v1 model, kept at the original path for backward compatibility (paired with `valid_data/`).
19
+ * **ByteFF-GNN**: Example training data in `ByteFF-GNN/example_data/`.
20
+
21
 
22
  # How to use
23
  Code and examples are available in the [byteff2](https://github.com/ByteDance-Seed/byteff2) repository.
24
 
25
  ## Citation
26
+ If you find ByteFF-Pol or ByteFF-GNN is useful for your research and applications, feel free to give us a star ⭐ or cite us using:
27
 
28
  ```bibtex
29
 
30
+ @article{zheng2026bridging,
31
+ title = {Bridging quantum mechanics to liquid properties via a universal organic force field},
32
+ author = {Tianze Zheng and Xingyuan Xu and Zhi Wang and Zhenze Yang and Yuanheng Wang and Xu Han and Lei Chen and Zhenliang Mu and Ziqing Zhang and Siyuan Liu and Sheng Gong and Kuang Yu and Wen Yan},
33
+ year = {2026},
34
+ journal = {Nature Communications},
35
+ doi = {10.1038/s41467-026-73566-3},
36
+ url = {https://www.nature.com/articles/s41467-026-73566-3}
 
37
  }
38
+
39
+ @Article{D4SC06640E,
40
+ author = {Tianze Zheng and Ailun Wang and Xu Han and Yu Xia and Xingyuan Xu and Jiawei Zhan and Yu Liu and Yang Chen and Zhi Wang and Xiaojie Wu and Sheng Gong and Wen Yan},
41
+ title = {Data-driven parametrization of molecular mechanics force fields for expansive chemical space coverage},
42
+ journal = {Chem. Sci.},
43
+ year = {2025},
44
+ pages = {-},
45
+ publisher = {The Royal Society of Chemistry},
46
+ doi = {10.1039/D4SC06640E},
47
+ url = {http://dx.doi.org/10.1039/D4SC06640E}
48
+ }
49
+
50
  ```
valid_data/dataset_config.yaml CHANGED
@@ -4,8 +4,7 @@ confdata:
4
  data_cls: ClusterData
5
  kwargs:
6
  max_n_confs: 20
7
- meta_fp: /mnt/bn/byteff-dev-large/users/zhengtianze/byteff_v2/meta.txt
8
  moldata: {}
9
- save_dir: /root/byteff2-community/byteff2/valid_data
10
  shards: 1
11
- timestamp: '25_11_14_11_41_09'
 
4
  data_cls: ClusterData
5
  kwargs:
6
  max_n_confs: 20
 
7
  moldata: {}
8
+ save_dir: ../../../byteff2/valid_data
9
  shards: 1
10
+ timestamp: '26_06_16_13_23_12'
valid_data/processed_data_shard0.pkl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4b34d1397f8871ebee9d20cd1c69f41115ebe1bb6c7c3b61e741fbf23005f1f2
3
- size 110560423
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c316628a2185cd029542cebd40dcb5754ab0bb2bd841060a7b7c4140a0bd5e7f
3
+ size 111042000