Latest in AI

Showing:StudentsClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

NotebookLM Upgrades Into an Agent That Proactively Conducts Research★ 72
INSIDE 硬塞 AI47 days agoRelease
Google is upgrading NotebookLM from a note-focused assistant into a research agent capable of multi-step work. The updated tool can analyze across documents, search the web, and help automate broader research workflows. It can also export results into formats such as presentations and documents, making it more useful for students, researchers, educators, and content creators who need to move from source material to finished outputs.
OpenLumara Creator Challenges Reddit to Hack Its Public Agent Instance
r/LocalLLaMA top day47 days agoIncident
The creator of OpenLumara posted a public challenge asking r/LocalLLaMA users to try breaking into a Discord-hosted instance of the local-model agent. They claimed common prompt-engineering attacks would not work because modules and sandboxes were heavily locked down. The post later listed several successful findings, including missing path traversal protection, an authorization-check bypass, and another undisclosed exploit pending a fix.
Emacs Appearances in Pop Culture
Hacker News (AI keywords)47 days agoCommentary
Based only on the title and metadata, this appears to be a curated or commentary-style post about Emacs references in pop culture. No article body was provided, so specific examples, interpretation, and scope cannot be verified. Its relevance is mainly cultural and historical for developers familiar with Emacs, rather than a current AI, model, or product update.
Show HN: macOS menu bar gauges for your Claude Code quota
Hacker News (AI keywords)48 days agoNew Tool
This Show HN post points to a GitHub project for displaying Claude Code quota in the macOS menu bar. Based only on the title, it appears to be a lightweight developer utility focused on visibility and workflow convenience. Details such as data source, refresh behavior, installation, license, and accuracy are not available from the provided content.
New to Local LLMs: Overwhelmed by Tool Choices, Model Naming, and Quantization
r/LocalLLaMA top day48 days agoTutorial
A first-time local LLM user installed ollama on Windows with gemma4 and qwen3.6, but quickly hit a wall of confusion around GUI tool selection, model size tradeoffs, and cryptic quantization naming like Q4_K_M and IQ4_XS. Despite owning high-end hardware (RTX 5090, 64GB DDR5, 9950X3D), the user lacks the foundational knowledge to make informed choices. The post highlights ongoing onboarding gaps in the local LLM ecosystem, where fragmented tooling and jargon-heavy documentation create steep barriers for newcomers.
Unsloth Gemma 4 QAT MTP assistant models now available
r/LocalLLaMA top day48 days agoRelease
A r/LocalLLaMA post notes that Unsloth’s Gemma 4 QAT MTP assistant models are now available in GGUF format. The root directories include q8_0 files named mtp-gemma-4-*.gguf, while MTP folders contain q8_0 and larger quantized variants. The listed releases cover 12B, 26B-A4B, 31B, E2B, E2B mobile, E4B, and E4B mobile it-qat-GGUF repositories.
Build a Basic AI Agent from Scratch: Long Task Planning
Hacker News (AI keywords)48 days agoTutorial
This source appears to be a tutorial about constructing a basic AI agent from scratch. Based only on the title, its focus is likely long-task planning: how an agent breaks a larger objective into steps and works through them over time. No article body was provided, so specific implementation choices, model providers, tools, code examples, or evaluation results cannot be confirmed.
Single-slot half-height PCIe V100 with NVLink appears in China
r/LocalLLaMA top day48 days agoHardware
A r/LocalLLaMA post says a Bilibili creator has shown a single-slot, half-height PCIe V100 with NVLink on a custom PCB. The card is described as 16 cm long, passively cooled by default, capped at 75W, with another version supporting up to 300W. The 16GB model is expected around or below ¥1500, with a 32GB version reportedly planned, but it is not yet available for purchase.
Show HN: Gravity – Interactive Solar-System Simulator from Newton to Einstein
Hacker News (AI keywords)48 days agoNew Tool
Gravity is an interactive, web-based solar system simulator that lets users explore celestial mechanics in their browser. It uniquely bridges classical Newtonian physics and Einstein's general relativity, allowing users to visualize and compare orbital behaviors under different gravitational models. It serves as an engaging educational tool for physics enthusiasts and students alike.
CSS: The Unavoidable Bad Parts
Hacker News (AI keywords)48 days agoTutorial
The post frames CSS as learnable in a useful subset, but full of surprising defaults and edge cases. It covers semantic HTML, wrappers, layout, browser defaults, resets, classless CSS, selectors, box sizing, margins, flexbox, responsiveness, pixels, font sizing, line height, and word breaking. The advice is pragmatic: keep markup semantic, reset inconsistent defaults, understand layout constraints, and test readability across configurations.
Making Graphics Like it's 1993: Building the Catlantean 3D Retro Engine
Hacker News (AI keywords)48 days agoTutorial
This post kicks off a series on building Catlantean 3D, a retro engine replicating 1993 graphics technology. The author bypasses modern GPUs to implement pure CPU software rendering, fixed-point math, and 256-color palettes. It offers a fascinating look into early 3D algorithms like raycasting and affine texture mapping, serving as an educational resource for low-level graphics.
GentleOS: A Classic Operating System with a Lovely Retro GUI
Hacker News (AI keywords)49 days agoRelease
GentleOS (gentleos32) is an open-source hobby operating system project on GitHub featuring a charming retro GUI. Developed by luke8086, it offers a nostalgic look at classic OS design and GUI implementation. It serves as an engaging resource for retro computing enthusiasts and low-level system developers.
Yu Ai Wei Wu Showcases Education AI Model and Learning Agent at Tencent Cloud Event
量子位 QbitAI49 days agoBusiness
According to the title, Yu Ai Wei Wu appeared at Tencent Cloud’s AI industry application conference with a focus on education models and learning Agents. The positioning suggests an effort to apply AI more deeply to personalized learning or teaching workflows. Since the original article text was not provided, specific product features, model architecture, partnerships, and real-world results cannot be verified.
Anyone seen benchmarks comparing Gemma 4 4-bit QAT vs. 8-bit standard quants?
r/LocalLLaMA top day49 days agoBenchmark
A r/LocalLLaMA user is looking for benchmarks comparing Gemma 4 4-bit QAT models, via Unsloth, against standard 8-bit non-QAT quantized models. They understand QAT is expected to preserve much of the BF16 baseline accuracy, but want hard numbers against traditional 8-bit PTQ. The post highlights scattered feedback but no clear head-to-head evaluation yet.
A llama.cpp CLI Command Builder
r/LocalLLaMA top day49 days agoNew Tool
A r/LocalLLaMA post introduces a llama.cpp CLI Command Builder with no accounts, email, pop-ups, cookies, or ads. It stores information locally in the browser and includes editable fields for flags and arguments found in the documentation. Users can build CLI or server commands, log run information, and compare which configurations work best for their hardware; only Linux is currently supported.
Developer Runs Half-Life at 30 FPS on a 2007 Nokia N95
Hacker News (AI keywords)49 days agoHardware
A developer reportedly managed to run Half-Life at 30 FPS on a Nokia N95, a smartphone originally released in 2007. Based on the title alone, the item appears to be a retro hardware and gaming-porting story rather than an AI development. The main significance is technical novelty: demonstrating an old mobile device handling a classic PC game at a playable frame rate.
llama.cpp PR adds MTP support for Gemma-4 E2B and E4B assistants
r/LocalLLaMA top day49 days agoRelease
The Reddit post links to ggml-org/llama.cpp Pull Request #24282, which adds MTP support for Gemma-4 E2B and E4B assistants. The submitter frames it as useful for tiny Gemma models on phones, low-end machines, Raspberry Pi, or similarly constrained devices. The post does not include benchmarks, merge status, or setup instructions, so it should be treated as a development signal rather than a finished release.
Was BitNet a dead end? What happened to ternary LLMs?
r/LocalLLaMA top day49 days agoCommentary
A r/LocalLLaMA user questions whether BitNet and ternary LLMs were a dead end after earlier promise around efficient low-bit models. The post notes that the largest ternary model appears to remain around 2B parameters. It asks why frontier open-weight AI labs are not visibly pursuing the approach, but provides no technical evidence or definitive answer.
Why Are Cells Small?
Hacker News (AI keywords)49 days agoTutorial
This essay explains why most cells remain small through two physical limits: surface-area-to-volume ratio and diffusion. As cells grow, volume rises faster than membrane area, making nutrient intake, waste removal, and energy support harder. Larger cells also slow molecular encounters, though examples like red blood cells, oocytes, organelles, and giant bacteria show how biology works around these constraints.
Full Reverse Engineering of the TI-84 Plus Operating System
Hacker News (AI keywords)49 days agoHardware
This Hacker News item links to an article titled “Full Reverse Engineering of the TI-84 Plus Operating System.” Based on the provided material, the reliable takeaway is that it concerns reverse engineering the OS of Texas Instruments’ TI-84 Plus graphing calculator. The original text was not provided, so specific claims about methods, findings, code, memory layout, or security implications cannot be verified here.
An Implementation of NanoQuant: A Flexible Binary Quantization Method
r/LocalLLaMA top day49 days agoNew Tool
A r/LocalLLaMA post presents an unofficial PyTorch implementation of NanoQuant, a 2026 post-training quantization method for dense transformers. The method factorizes weights into scaling vectors and binary matrices, then quantizes and fine-tunes blocks sequentially to reduce hardware requirements. Early Qwen3-0.6B and Qwen3-4B experiments are promising for base models, but instruct quality remains weak and highly dependent on calibration data.
GitHub for Beginners: Answers to some common questions
GitHub Blog49 days agoTutorial
GitHub Blog published a beginner-focused article that points readers to answers for common GitHub-related questions. The provided excerpt is brief and does not specify which questions or workflows are covered. Its value is mainly educational, serving as an entry point for people who are new to GitHub rather than as a product launch or technical deep dive.
NotebookLM’s Gemini 3.5 upgrade adds a cloud computer and help finding sources
The Verge AI49 days agoRelease
Google is rolling out broad updates to NotebookLM, its AI-powered note-taking and research app launched in 2023. The app now uses Google’s upgraded Gemini 3.5 model, which the company says should provide more accurate and reliable responses. The update also adds a cloud computer and help finding sources, expanding NotebookLM beyond source-based Q&A into a broader research assistant workflow.
Why are so many young people getting cancer? What researchers do and don't know
Hacker News (AI keywords)49 days agoCommentary
Nature reports that researchers are investigating why more young people are developing cancers once associated mainly with older age. Emerging explanations exist, but the article stresses that causes are likely to differ by tumor type. The visible article metadata frames the issue as cancer, public health, and epidemiology, with many uncertainties still unresolved.
Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax★ 72
r/LocalLLaMA top day49 days agoNew Tool
Luce Spark is an open-source MoE offload system for running 33B-35B A3B models on 16GB-class GPUs. It keeps frequently routed experts on GPU, stores the long tail in system RAM, and swaps cold experts through a bounded async cache. The author reports 13.3 GiB for Qwen3.6 35B-A3B and about 100 tok/s with Spark optimizations, but notes real 16GB GPU testing is still missing.
[3090] Gemma4 QAT + MTP quick TPS numbers
r/LocalLLaMA top day49 days agoBenchmark
A r/LocalLLaMA user shared quick throughput numbers for Gemma4 QAT with MTP speculative decoding on an RTX 3090 24GB setup. They report roughly 1.2-1.8x TPS improvement, with Gemma 4 31B moving from about 40 tok/s to 70-80 tok/s. The author frames this as a rough benchmark, using 11 task categories and noting stochastic variation from temp 1.0.
Zig by Example
Hacker News (AI keywords)49 days agoTutorial
Zig by Example is a GitHub tutorial project inspired by Go by Example. It introduces Zig through annotated examples covering syntax, types, control flow, errors, pointers, comptime, generics, allocation, testing, file I/O, JSON, the build system, and C interop. This is not an AI model or product launch, but it is useful learning material for developers exploring Zig 0.14.
What was your local daily driver for coding last week?
r/LocalLLaMA top day49 days agoCommentary
This r/LocalLLaMA post is a brief community poll asking users what their local coding daily driver was last week. The post asks commenters to share their favorite model and quant, but the provided text does not include poll options, results, or specific model names. Its value is mainly as a community signal for tracking local LLM coding preferences.
Co/plot: Supporting the Research Process Through Visualization
Cohere Blog50 days agoNew Tool
Cohere has announced "Co/plot," a tool dedicated to supporting the research process through advanced visualization. It aims to help researchers and developers better understand complex data structures, model behaviors, and research workflows. This launch highlights Cohere's expanding focus on building practical developer and researcher tools that complement their core LLM and embedding models.
Cohere "AI for Developers": A Hub for Enterprise RAG, Multilingual Models, and Agentic Tools
Cohere Blog50 days agoTutorial
Cohere's dedicated developer portal centralizes guides on leveraging their Command models, Embed, and Rerank APIs. It focuses on practical implementations of Retrieval-Augmented Generation (RAG), tool use for agents, and fine-tuning. This hub serves as a critical resource for engineers deploying production-grade, multilingual AI systems.

← PreviousPage 2Next →

Latest in AI

NotebookLM Upgrades Into an Agent That Proactively Conducts Research★ 72

OpenLumara Creator Challenges Reddit to Hack Its Public Agent Instance

Emacs Appearances in Pop Culture

Show HN: macOS menu bar gauges for your Claude Code quota

New to Local LLMs: Overwhelmed by Tool Choices, Model Naming, and Quantization

Unsloth Gemma 4 QAT MTP assistant models now available

Build a Basic AI Agent from Scratch: Long Task Planning

Single-slot half-height PCIe V100 with NVLink appears in China

Show HN: Gravity – Interactive Solar-System Simulator from Newton to Einstein

CSS: The Unavoidable Bad Parts

Making Graphics Like it's 1993: Building the Catlantean 3D Retro Engine

GentleOS: A Classic Operating System with a Lovely Retro GUI

Yu Ai Wei Wu Showcases Education AI Model and Learning Agent at Tencent Cloud Event

Anyone seen benchmarks comparing Gemma 4 4-bit QAT vs. 8-bit standard quants?

A llama.cpp CLI Command Builder

Developer Runs Half-Life at 30 FPS on a 2007 Nokia N95

llama.cpp PR adds MTP support for Gemma-4 E2B and E4B assistants

Was BitNet a dead end? What happened to ternary LLMs?

Why Are Cells Small?

Full Reverse Engineering of the TI-84 Plus Operating System

An Implementation of NanoQuant: A Flexible Binary Quantization Method

GitHub for Beginners: Answers to some common questions

NotebookLM’s Gemini 3.5 upgrade adds a cloud computer and help finding sources

Why are so many young people getting cancer? What researchers do and don't know

Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax★ 72

[3090] Gemma4 QAT + MTP quick TPS numbers

Zig by Example

What was your local daily driver for coding last week?

Co/plot: Supporting the Research Process Through Visualization

Cohere "AI for Developers": A Hub for Enterprise RAG, Multilingual Models, and Agentic Tools