LLMs can't stop making up software dependencies and sabotaging everything

Sandbox MCP: Enable LLMs to run ANY code safely

Naur's "Programming as Theory Building" and LLMs replacing human programmers

Mellum Goes Open Source: A Purpose-Built LLM for Developers, Now on Hugging Face

LLMs can see and hear without any training

Local LLM inference – impressive but too hard to work with

Bamba: An open-source LLM that crosses a transformer with an SSM

Recursive LLM prompts

The State of Reinforcement Learning for LLM Reasoning

Teaching LLMs how to solid model

The Policy Puppetry Attack: Novel bypass for major LLMs

Can LLMs do randomness?

Can reinforcement learning for LLMs scale beyond math and coding tasks? Probably

Can LLMs earn $1M from real freelance coding work?

End-to-end private LLM inference

Teuken-7B-Base and Teuken-7B-Instruct: Towards European LLMs (2024)

LLMs for Engineering: Teaching Models to Design High Powered Rockets

LLM Benchmark for 'Longform Creative Writing'

12-factor Agents: Patterns of reliable LLM applications

Meaning Machine – Visualize how LLMs break down and simulate meaning

Tiny-LLM – a course of serving LLM on Apple Silicon for systems engineers

Llama 2 LLM on DOS

Gemini 2.5: The First LLM That Understands PDF Layouts

Does RL Incentivize Reasoning in LLMs Beyond the Base Model?

MooseAgent: A LLM Based Multi-Agent Framework for Automating Moose Simulation

Some security in LLM based apps

SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators

Show HN: Light like the Terminal – Meet GTK LLM Chat Front End

Privacy folks – what's your take on using LLMs at work?

Lossless LLM compression for efficient GPU inference via dynamic-length float

More →