81.8K Downloads Updated 6 months ago
Athene-V2 is a 72B parameter model which excels at code completion, mathematics, and log extraction tasks.
tools
72b
Models
View all →Readme
Athene-V2
Nexusflow’s Athene-V2 chat model, built on Qwen 2.5’s 72B foundation, achieves GPT-4o-level performance across key benchmarks while demonstrating how targeted optimization can enhance specific capabilities beyond traditional scaling approaches.
Model Features
- 72B parameters fine-tuned from Qwen 2.5
- State-of-the-art chat performance matching or exceeding GPT-4o
- Superior code completion (ranking #2 on bigcode-bench-hard)
- Enhanced mathematics capabilities (MATH benchmark)
- Precise long-form log extraction
- Advanced post-training pipeline pushing the Pareto frontier