Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
KingNishΒ 
posted an update 11 days ago
Post
4254
We trained an open-source Mythos like cybersecurity LLM for the Build Small Hackathon meet OpenMythos

Trained in two stages: SFT on ~1.84K filtered ArXiv cs.CR papers + real CVE data, then RLVR using paired with past vulnerabilities GitHub repos with a verifier model checking outputs against ground truth.

Trained on: H100s from Modal

The RLVR stage made the biggest difference responses got more precise and less prone to confusing similar vulnerability classes.

Everything is open:
πŸ€– Demo β†’ build-small-hackathon/OpenMythos
🧠 Model β†’ build-small-hackathon/OpenMythos
πŸ“¦ CVE Dataset β†’ build-small-hackathon/CVE_Vulnerailities_Detailed
πŸ“„ ArXiv Dataset β†’ himanshu17HF/ArvixImport-Filtered-Final

Try it out and let us know where it breaks πŸ™
This comment has been hidden
In this post