Use Prolog to improve LLM's reasoning

How 15 Top LLMs perform on classification: accuracy vs. cost breakdown

DeepSeek: Advancing theorem proving in LLMs through large-scale synthetic data

Using LLMs to enhance our testing practices

Apple study proves LLM-based AI models are flawed because they cannot reason

Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs

LLMs, Theory of Mind, and Cheryl's Birthday

First class Prompt Engineering with llm lang! (This is a bad idea.)

Understanding the Limitations of Mathematical Reasoning in LLMs

Pythagora: Auto-Generate Node.js Tests Using LLMs, No Coding Required

A Startup CTO's thoughts on LLMs writing Code

A Selective Survey of Efficient Speculative Decoding Techniques for LLM Inference

Llamafile for Meltemi: The First LLM for Greek

A research found a way to generate structurally sound and functionally relevant RNA sequences using LLM, potentially advancing applications in areas like therapeutic and targeted RNA design

Generated Checklists Improve LLM Evaluation and Generation

Johnny LLM Can't Read Code

Launch HN: Integuru (YC W24) – Reverse-engineer internal APIs using LLMs

LLM inference library written in Rust with mistral.rs

SmartCat: my attempt at making the most efficient LLM tool for terminal dwellers

You can now run prompts against images, audio and video in your terminal using LLM

PixelVerse t1 – CoT prompting outperforms flagship LLMs

The Prompt() Function: Use the Power of LLMs with SQL

Show HN: We wrote a book on LLM system evals with a bear and fox

Microsoft BitNet: inference framework for 1-bit LLMs

Running LLMs with 3.3M Context Tokens on a Single GPU

NVLM: Open Frontier-Class Multimodal LLMs

oss-fuzz-gen: LLM powered fuzzing via OSS-Fuzz

Lm.rs: Minimal CPU LLM inference in Rust with no dependency

Local TypeScript Super SDK to Call 200 LLMs

OpenAPI definitions, converters and LLM function calling application composer.

More →