Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
SushiHippie
28 days ago
|
parent
|
context
|
favorite
| on:
Llama3 running locally on iPhone 15 Pro
Seems like it is the 3bit quantized version? (Judging by the file size)
And does anyone know how many tokens per second it can run?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
And does anyone know how many tokens per second it can run?