Wang, Yiping, Jing Wang, Junhao Zhu, Fengyao Zhai, Hu Zhu, Ziwei Dai, Zengru Di, Da Zhou, and Yu Liu. “Compression-Based Tokenization Improves Language Modeling of Hierarchical Genomic Structure”. LangTaoSha Preprint Server, December 11, 2025. Accessed February 5, 2026. https://www.langtaosha.org.cn/index.php/lts/preprint/view/51.