Skip to content

xinwuye/MMSciBench-code

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MMSciBench

πŸ“‘  Paper | πŸ“Š  MMSciBench

License Stars Issues

πŸ“Œ Overview

MMSciBench focuses on mathematics and physics that evaluates scientific reasoning capabilities. This repository contains the code for the benchmark. The dataset is available on Hugging Face: MMSciBench Dataset.

πŸ“– Paper

If you use this benchmark in your research, please cite our paper:

@article{ye2025mmscibench,
  title={MMSciBench: Benchmarking Language Models on Chinese Multimodal Scientific Problems},
  author={Ye, Xinwu and Li, Chengfan and Chen, Siming and Wei, Wei and Tang, Xiangru},
  journal={Findings of the Association for Computational Linguistics: ACL 2025},
  year={2025}
}

πŸ›  Installation

Clone the repository:

git clone https://github.com/xinwuye/MMSciBench-code.git
cd MMSciBench-code

πŸ“Š Benchmark Dataset

The dataset for MMSciBench is available on Hugging Face:

πŸ”— MMSciBench Dataset

πŸš€ Usage

Running Evaluation

To evaluate models on the benchmark, use the following command:

python exp.py
python exp_hf.py
python eval1.py

πŸ“ˆ Results

Once evaluation is complete, results will be saved.

πŸ“œ License

This project is licensed under the Apache-2.0 License.

🀝 Contributing

We welcome contributions! To contribute:

  1. Fork the repository.
  2. Create a new branch: git checkout -b feature-branch-name.
  3. Commit your changes: git commit -m 'Add a new feature'.
  4. Push to the branch: git push origin feature-branch-name.
  5. Open a pull request.

πŸ”— Acknowledgments

We thank the open-source community and previous works that inspired this benchmark.

πŸ“¬ Contact

For questions or collaborations, please open an issue.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages