5 years? 5 years is a millennia these days. We’ll have small local models beatin...

refulgentis · 2024-04-23T04:16:32

Absolutely not on the first one. Not even close.

ashirviskas · 2024-04-23T16:48:51

Why not? There's still 7 months left for breakthroughs.

refulgentis · 2024-04-23T18:38:43

Small leaves wiggle room, but it's extremely unlikely trad small, <= 7B, will get there this year even on these evals.

UX matching is a whole different matter and needs a lot of work: Worked heavily with Llama 8B over last days, and Phi 3 today, and the Q+A benchmarks don't tell the full story. Ex. It's nigh impossible to get Llama _70_B to answer in JSON; when Phi sees RAG from search it goes off inventing new RAG material and a new question.