Loading...

Tag trends are in beta. Feedback? Thoughts? Email me at [email protected]

AI Energy Score v2: Refreshed Leaderboard, now with Reasoning

DeepSeek-v3.2: Pushing the frontier of open large language models [pdf]

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

DeepSeek-v3.2

HunyuanOCR by Tencent: A 1B Parameter End to End OCR Expert VLM

Hugging Face the company that hosts open source AI models from organizations like Meta and OpenAI, is currently hosting the Epstein files to support AI driven investigative journalism.

The Epstein Files were hosted on the popular AI model sharing hub Hugging Face. Over last week, it became the most downloaded dataset on the platform

Drax: Speech Recognition with Discrete Flow Matching

The Smol Training Playbook: The Secrets to Building World-Class LLMs

Show HN: The Legal Embedding Benchmark (MLEB)

Show HN: Chonky – a neural text semantic chunking goes multilingual

Sentence Transformers is joining Hugging Face

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Compact VLM

Open source speech foundation model that runs locally on CPU in real-time

You can now use Google Maps as an AI assistant to find places, reviews, etc., for free

Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS

I built a Universal File Converter with Gradio & Hugging Face looking for feedback 🚀

Wan2.2-S2V-14B – audio-driven cinematic video generation model

grok-2 on Hugging Face

Qwen Image

LFM2 WebGPU

Qwen3-4B-Thinking-2507

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

Implemented the research paper “Memorizing Transformers” from scratch with my own additional modifications in architecture and customized training pipeline .

Beyond Python: AI Agents in JavaScript with KaibanJS

LLM Embeddings Explained: A Visual and Intuitive Guide

Qwen3-Coder-30B-A3B-Instruct

Qwen3 235B beats Claude on some code benchmarks

Qwen3 30B-A3B

More →