👨💻
myHN
Top
New
Best
Ask
Show
Job
ZAYA1-8B: An 8B Moe Model with 760M Active Params Matching DeepSeek-R1 on Math
by steveharing1 |
View on Hacker News