Latest in AI

Showing:StudentsLlamaClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

A llama.cpp CLI Command Builder
r/LocalLLaMA top day49 days agoNew Tool
A r/LocalLLaMA post introduces a llama.cpp CLI Command Builder with no accounts, email, pop-ups, cookies, or ads. It stores information locally in the browser and includes editable fields for flags and arguments found in the documentation. Users can build CLI or server commands, log run information, and compare which configurations work best for their hardware; only Linux is currently supported.
Arithmetic Without Numbers: How LLMs Do Math
Hacker News (AI keywords)53 days agoCommentary
The article asks whether LLM arithmetic is memorization, heuristics, real computation, or experimental assistance. It summarizes Rune experiments that decode operations and operands from frozen Llama activations, then route them to Python under a no-parser rule. The strongest supported claim is narrow: activation-derived tool arguments worked in scoped audits, while residual-state JIT replacement, long-number generation, and cross-model transfer remain brittle.
How LLMs Actually Work
Hacker News (AI keywords)54 days agoTutorial
The article explains how modern LLMs convert text into token IDs, embeddings, and position-aware vectors before passing them through stacked transformer blocks. It covers attention, multi-head attention, KV cache, GQA, feed-forward networks, MoE, residual streams, normalization, and decoding. Its goal is educational: helping readers understand the common architecture behind many current model families and read model cards or papers more confidently.
免費訓練 AI 模型！Hugging Face 聯手 Unsloth 推出 Hugging Face Jobs 免費微調服務★ 85
Hugging Face Blog158 days agoNew Tool
Hugging Face's official blog has announced exciting news for the open-source AI community: Hugging Face has formed a deep partnership with Unsloth — the…
Hugging Face 經典 NLP 課程正式轉型為 LLM 課程：迎向大語言模型時代的全面升級★ 85
Hugging Face Blog481 days agoTutorial
Hugging Face's "NLP Course" has long been a must-read classic for developers and researchers worldwide looking to enter the fields of Transformers and natural…
Open-R1：Hugging Face 推出完全開源的 DeepSeek-R1 重現計劃★ 90
Hugging Face Blog546 days agoRelease
### Project Background: Recreating the Open-Source Miracle of DeepSeek-R1 The emergence of DeepSeek-R1 sent shockwaves through the global AI community…
你也能設計出最先進的 Transformer 位置編碼：從直覺到 RoPE 的數學推導★ 75
Hugging Face Blog610 days agoTutorial
This educational article from Hugging Face aims to guide readers — in the most intuitive, step-by-step way — to "reinvent" RoPE (Rotary Position Embedding)…
非工程師指南：如何訓練專屬的 LLaMA 2 聊天機器人
Hugging Face Blog1,034 days agoTutorial
This official guide from Hugging Face is designed for readers without a technical background. It provides a detailed walkthrough of how to use Hugging Face's…