Hacker News new | past | comments | ask | show | jobs | submit | from login
The Matrix: A Bayesian learning model for LLMs (arxiv.org)
139 points by stoniejohnson 24 days ago | past | 10 comments
CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions (arxiv.org)
43 points by tosh 24 days ago | past | 1 comment
A Primer on the Inner Workings of Transformer-Based Language Models (arxiv.org)
4 points by jonbaer 24 days ago | past
Polarization entangled photons over NYC fiberoptic network (arxiv.org)
1 point by cookingrobot 25 days ago | past
Porting HPC Applications to AMD MI300A Using Unified Memory and OpenMP (arxiv.org)
2 points by latchkey 25 days ago | past
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding (arxiv.org)
78 points by PaulHoule 25 days ago | past
LoRA Land: 310 Fine-Tuned LLMs That Rival GPT-4, a Technical Report (arxiv.org)
3 points by milliondreams 25 days ago | past
Kolmogorov-Arnold Networks, an alternative to Multi-Layer Perceptrons (arxiv.org)
2 points by thatxliner 25 days ago | past
The Matrix: A Bayesian learning model for LLMs (arxiv.org)
3 points by smaddox 25 days ago | past
Network reconstruction via the minimum description length principle (arxiv.org)
2 points by Anon84 25 days ago | past
Mapping the Increasing Use of LLMs in Scientific Papers (arxiv.org)
3 points by rntn 25 days ago | past
Technosignatures Longevity and Lindy's Law (arxiv.org)
1 point by belter 25 days ago | past
Kolmogorov–Arnold Networks: Alternative to Multilayer Perceptrons. (arxiv.org)
2 points by georgehill 25 days ago | past | 3 comments
Examination of Large Language Model Performance on Grade School Arithmetic (arxiv.org)
2 points by s-macke 25 days ago | past
Evoke: Emotion Enabled Virtual Avatar Mapping Optimized Knowledge Distillation (arxiv.org)
1 point by sandwichukulele 26 days ago | past | 1 comment
An Examination of Large Language Model Performance on Grade School Arithmetic (arxiv.org)
8 points by kmdupree 26 days ago | past | 1 comment
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs (arxiv.org)
2 points by zerojames 26 days ago | past
Scalable Bayesian Inference in the Era of Deep Learning (arxiv.org)
1 point by georgehill 26 days ago | past
The Metaverse Within Everyday Environments: A Coarse-to-Fine Approach (arxiv.org)
2 points by PaulHoule 26 days ago | past
Capabilities of Gemini Models in Medicine (arxiv.org)
1 point by tosh 26 days ago | past
Alice's Adventures in a Differentiable Wonderland (arxiv.org)
1 point by Schiphol 26 days ago | past | 2 comments
The Electromagnetic Mass Dilemma (arxiv.org)
2 points by raattgift 26 days ago | past | 1 comment
The Feasibility of Implementing Large-Scale Transformers on Multi-FPGA Platforms (arxiv.org)
3 points by PaulHoule 26 days ago | past
Shape of Money Laundering: Subgraph Representation Learning on the Blockchain (arxiv.org)
2 points by Anon84 26 days ago | past
Is Model Collapse Inevitable? (arxiv.org)
5 points by tosh 26 days ago | past | 1 comment
Kan: Kolmogorov-Arnold Networks (arxiv.org)
4 points by hardmaru 26 days ago | past
A Careful Examination of LLM Performance on Grade School Arithmetic (arxiv.org)
2 points by GaggiX 26 days ago | past
Detection of Depressive Episodes Through Pupillary Response in the Wild (arxiv.org)
1 point by PaulHoule 27 days ago | past
Modeling Dynamic (De)Allocations of Local Memory for Translation Validation (arxiv.org)
2 points by luu 27 days ago | past
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking (arxiv.org)
1 point by jonbaer 27 days ago | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: