Loading...

Tag trends are in beta. Feedback? Thoughts? Email me at [email protected]

We need data engineering benchmarks for LLMs

LLM Prompt Engineering

Task-specific LLM evals that do and don't work

LLM abstraction levels inspired by fish eye lens

Automated reasoning to remove LLM hallucinations

Training LLMs to Reason in a Continuous Latent Space

Test Driven Development (TDD) for your LLMs? Yes please, more of that please

AI hallucinations: Why LLMs make things up (and how to fix it)

Microsoft Bets $10K on Prompt Injection Protections of LLM Email Client

Introducing Kheish: An Open-Source Platform for Orchestrating Complex LLM Workflows

Show HN: Powerdrill – Leverage LLMs to Simplify Data Analysis

First impressions of the new Amazon Nova LLMs (via a new llm-bedrock plugin)

Show HN: Prompt Engine – Auto pick LLMs based on your prompts

Show HN: Countless.dev – A website to compare every AI model: LLMs, TTSs, STTs

Feedback for project creating conversational agents using a Finite State Machine (FSM) and LLMs

Understanding LLM's Prompt Engineering James Bond Style

How to Get LLMs to Validate Each Other?

Feedback for project creating conversational agents using a Finite State Machine (FSM) and LLMs

RedSage is a lightweight terminal-based pair programming assistant that integrates with LLMs to provide real-time coding support for developers.

Speed up your AI & LLM-integration with HTTP-Streaming

Go for APIs with NLP and LLMs: Whats the standard approach

RedSage is a terminal-based pair programming assistant that integrates with LLMs. Opensource.

PydanticAI: AI Agent framework for using Pydantic with LLMs

CntxtPY: Smarter Python Context Management for LLMs (Open Source, MIT)

Autonomous LLM-Driven Research — from Data to Human-Verifiable Research Papers

Full LLM training and evaluation toolkit

OK, I can partly explain the LLM chess weirdness now

OpenCoder: Open-Source LLM for Coding

QwQ: Alibaba's O1-like reasoning LLM

An Intuitive Explanation of Sparse Autoencoders for LLM Interpretability

More →