Latest in AI

Showing:ResearchersClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Anthropic opens Milan office to support Italian enterprise, research, and developers
Anthropic News50 days agoBusiness
Anthropic announced on May 27, 2026 that it opened a Milan office focused on Italian enterprises, researchers, and developers. Based only on the title, this appears to be a regional business expansion rather than a model or product launch. The main relevance is Anthropic’s continued investment in local European presence and ecosystem support.
Anthropic raises $65B Series H at $965B post-money valuation★ 76
Anthropic News50 days agoBusiness
Anthropic announced on May 28, 2026 that it raised $65 billion in Series H funding at a $965 billion post-money valuation. The supplied source includes only the title, so investor names, use of funds, revenue details, or product implications cannot be confirmed. The news is significant as a business and funding signal for the company behind Claude, but deeper interpretation requires the full announcement.
What We Learned Mapping a Year's Worth of AI-Enabled Cyber Threats★ 74
Anthropic News50 days agoEthics
Anthropic analyzed 832 accounts banned for malicious cyber activity from March 2025 to March 2026 and mapped them to MITRE ATT&CK. The report says attackers increasingly use AI beyond preparation, applying it to post-compromise tasks such as account discovery, lateral movement, and privilege escalation. Anthropic argues that frameworks need to capture agentic orchestration, chained attack stages, real-time decisions, and low-human-intervention operations.
An update on our election safeguards
Anthropic News50 days agoEthics
Anthropic explains how Claude is being prepared for major 2026 elections, including political neutrality training, policy enforcement, abuse detection, and reliable information routing. The post reports high evaluation scores for Opus 4.7 and Sonnet 4.6 across bias, election-policy compliance, influence-operation resistance, and web-search triggering. Claude.ai will also show election banners that point users to trusted voter resources such as TurboVote.
Widening the conversation on frontier AI
Anthropic News50 days agoEthics
Anthropic says it has been holding dialogues with religious, philosophical, ethical, and cross-cultural groups about frontier AI. The work focuses on moral formation, Claude’s constitution, and what kind of character an AI system should exhibit under pressure. The company also describes an early experiment where Claude could call an ethical reminder tool during tasks, which reduced misaligned behavior in several internal evaluations.
Chris Olah's remarks on Pope Leo XIV's AI encyclical
Anthropic News50 days agoEthics
Anthropic News published the full text of co-founder Chris Olah's remarks on Pope Leo XIV's encyclical, “Magnifica humanitas.” Based on the title alone, the piece appears to be a public commentary on AI, ethics, and human values rather than a product or research announcement. The original article text was not provided, so no specific claims, positions, or policy details can be verified.
Expanding Project Glasswing
Anthropic News50 days agoBusiness
Anthropic announced an expansion of Project Glasswing on June 2, 2026. The project will extend to approximately 150 new organizations in more than fifteen countries. Based only on the provided title, this appears to be a program expansion rather than a new model, product feature, or developer tool release.
Introducing Claude Opus 4.8★ 82
Anthropic News50 days agoRelease
Anthropic introduced Claude Opus 4.8 as an upgrade over Opus 4.7, with stronger benchmark performance across coding, agentic skills, reasoning, and knowledge work. The release also adds dynamic workflows in Claude Code, effort controls in claude.ai and Cowork, and new Messages API support for system entries inside the messages array. Pricing for regular usage remains unchanged, while fast mode is now cheaper than previous models.
Thoughts on Gemma4 12B vs 26A4B: Which Is Better?
r/LocalLLaMA top day51 days agoOpinion
The post asks the LocalLLaMA community to compare Gemma4 12B and 26A4B, explicitly excluding the 31B model from discussion. The user is mainly interested in creative tasks, writing, and chatting, with coding treated as optional rather than central. No benchmarks or examples are provided, so the post is best read as a model-selection question about subjective quality and practical use.
Do agents.md files help coding agents?
Hacker News (AI keywords)51 days agoCommentary
The source only provides the title, so no conclusion or evidence can be verified. The topic appears to ask whether an agents.md file helps coding agents understand project conventions, commands, and constraints. This is relevant to developers adopting AI coding tools, but any claims about effectiveness would require the original post or supporting examples.
Google's Official Gemma 4 QAT Q4_0 GGUFs Have Higher Precision Than Unsloth's Q4_K_XL
r/LocalLLaMA top day51 days agoCommentary
An analysis of Gemma 4 QAT GGUF files reveals that Google's official 'Q4_0' releases actually employ a mixed-precision strategy. For smaller models like E2B and E4B, Google keeps critical token embeddings in Q6_K and certain projection weights in F16. This makes Google's Q4_0 files larger and more precise than Unsloth's 'Q4_K_XL' versions, which default to standard Q4_0 for almost all tensors.
SDSU Wired Its Dorms with 1,300 AI Cameras Without Telling Students
Hacker News (AI keywords)51 days agoEthics
San Diego State University reportedly deployed around 1,300 AI-enabled cameras across campus, including roughly 330 tied to student dorm areas. The controversy centers on whether students were adequately informed and whether residential common areas should be treated as ordinary surveillance zones. With no full article text provided, the strongest reading is that this is an AI governance and privacy incident, not a model or product launch story.
Gemma 4 31B FP8 Matches Claude Sonnet 4.6 Medium in Custom Benchmark★ 75
r/LocalLLaMA top day51 days agoBenchmark
A Reddit user shared benchmark results showing Google's Gemma 4 31B (FP8) performing on par with Claude Sonnet 4.6 Medium. The custom evaluation harness tested complex tasks including Neo4j Cypher queries, entity extraction, agentic tool calling, Python coding, and multi-vector retrieval synthesis. This highlights how quantized mid-sized open-source models are closing the gap with leading proprietary frontier models.
Jensen Huang Signs Korea Deals with SK Hynix, NAVER and Doosan
INSIDE 硬塞 AI51 days agoHardware
Nvidia announced partnerships with SK Hynix, NAVER and Doosan Group to bring its technology into AI data center projects in Korea. The collaboration also covers next-generation memory development, tying Nvidia more closely to Korea’s semiconductor and digital infrastructure ecosystem. The article does not specify investment size, deployment timeline or data center scale.
Best Local TTS Solution
r/LocalLLaMA top day51 days agoCommentary
A r/LocalLLaMA user says they have tested many local TTS tools, but none match ElevenLabs for expressiveness, voices, and cloning. They list moss-nano and Kokoro as the best edge-device candidates so far, with edgeTTS as a free/cloud option. The post asks for community experience connecting agents such as Hermes, openclaw, or opencode to Telegram voice notes or real-time voice conversations.
Algorithmic Monocultures in Hiring★ 78
Hacker News (AI keywords)51 days agoPaper
This study analyzes 3.4 million real applicants and 4 million applications across 156 U.S. employers. It finds position-level racial adverse impact that aggregate analysis can obscure, especially affecting Black and Asian applicants. The authors also show that reliance on a single vendor can create homogeneous outcomes and systemic rejections, calling for stronger audits, surveillance, and researcher access.
DeepSeek V4 Pro beats GPT-5.5 Pro on precision
Hacker News (AI keywords)51 days agoBenchmark
RuntimeWire compared DeepSeek V4 Pro and GPT-5.5 Pro across four fresh text tasks, with DeepSeek winning 38.0 to 33.0. The article highlights DeepSeek’s stronger handling of regex edge cases, workplace-update constraints, and exact JSON schema compliance. GPT-5.5 Pro remained capable, but lost points for avoidable deviations, extra process details, and minor structural mismatches.
A Matter Wi-Fi Light Bulb in Rust on the Raspberry Pi Pico 2 W
Hacker News (AI keywords)51 days agoHardware
This GitHub repository collects Rust Embassy examples for Raspberry Pi Pico 2 and Pico 2 W. Its Matter Wi-Fi light example uses rs-matter, BLE commissioning, and Wi-Fi connectivity so the board can appear as a standard smart bulb in Home Assistant, Apple Home, or Google Home. The project is mainly relevant to embedded Rust and smart-home developers, not AI model users.
The Open Source Community is backing OpenEnv for Agentic RL
Hugging Face Blog51 days agoCommentary
The title indicates that OpenEnv is being positioned around agentic reinforcement learning. The confirmed signal is community support from the open-source ecosystem, not specific technical claims. Without the full article, details such as contributors, features, integrations, benchmarks, or adoption status should be treated as unknown.
datasette-agent-edit 0.1a0
Simon Willison's Weblog51 days agoRelease
Simon Willison released datasette-agent-edit 0.1a0 as a base plugin for Datasette Agent. It is intended to support future plugins that edit existing text, including collaborative Markdown, large SQL queries, and SVG files. The design follows Claude’s text editor tool pattern, exposing view, str_replace, and insert primitives so other plugins can reuse a stricter editing workflow.
"Fully Hallucinated Operating System" Simulates an Entire OS via LLM Prompts
r/LocalLLaMA top day51 days agoCommentary
A popular Reddit post highlights a video demonstrating a "Fully Hallucinated Operating System" run entirely inside an LLM. By prompting the model to act as a terminal, it simulates file systems, network requests, and command execution purely through text generation. While impractical for production, this experiment showcases the impressive state-tracking and "world model" capabilities of modern LLMs.
How much do amd64 microarchitecture levels help in Go?
Hacker News (AI keywords)51 days agoBenchmark
Daniel Lemire tests Go’s GOAMD64 levels using Roaring Bitmaps on a modern Intel Xeon. v2 brings strong gains where popcnt matters, while v3 adds further speedups in dense bitmap and set-operation workloads through AVX2. v4, despite implying AVX-512 support, shows no meaningful improvement in these benchmarks, likely due to current Go compiler limitations.
Qwen 3.6 27B DeepSWE Benchmark Results Highlight Gap Between Local and Closed-Source Models
r/LocalLLaMA top day51 days agoBenchmark
A community benchmark of Qwen 3.6 27B on DeepSWE yielded a score of 1.79% (18/20th place), slightly outperforming Haiku 4.5. Run on a single RTX 6000 Blackwell GPU via vLLM with reasoning enabled, the test averaged 32 minutes and 44k output tokens per task. The author notes that while Qwen 3.6 27B represents a 'poor man's local SOTA,' the massive gap compared to frontier closed models suggests local LLMs are struggling to keep pace in complex coding.
Amazing Digital Dentures (a failed project)
Hugging Face Blog51 days agoCommentary
The post appears to discuss a project called “Amazing Digital Dentures,” explicitly framed as a failed project. Because the article body was not provided, the specific technical stack, models, tools, datasets, and reasons for failure cannot be verified. Based on the title and URL path, it may be a hackathon-style project retrospective focused on prototyping challenges and lessons learned.
Exploring 2-bit QAT: Can Ultra-Compressed Large Models Outperform 4-bit Models Half Their Size?
r/LocalLLaMA top day51 days agoCommentary
A popular Reddit thread on r/LocalLLaMA discusses the potential of 2-bit Quantization Aware Training (QAT) for large MoE models (120B to 400B). While current QAT efforts focus on 4-bit, users speculate whether a 2-bit QAT model could fit into consumer hardware (64GB/128GB RAM) and outperform a 4-bit model of half its size. This approach is proposed as a practical alternative to training ternary (1.58-bit) LLMs from scratch.
If LLMs Have Human-Like Attributes, Then So Does Age of Empires II
Hacker News (AI keywords)51 days agoPaper
The paper argues that claims about LLMs having human-like attributes, such as morality or language understanding, can be methodologically fragile. By building and training a simple neural network on Age of Empires II, the author suggests such attributes may not be empirically unique to LLMs. The key recommendation is to define explicit measurement criteria and use a null assumption of LLM non-uniqueness before drawing anthropomorphic conclusions.
Gemma-4-26B-A4B QAT Variant Performs Poorly in llama.cpp Compared to Non-QAT Version
r/LocalLLaMA top day51 days agoBenchmark
A LocalLLaMA user highlighted that the newly released QAT (Quantization-Aware Training) variant of Google's Gemma-4-26B-A4B model underperforms compared to its non-QAT predecessor. Testing via llama.cpp on a chessboard SVG generation task showed significant rendering errors in the QAT version. The non-QAT GGUF version, however, produced highly accurate results under identical settings.
Office-open-xml-viewer: Office XML document viewer rendering to HTML Canvas
Hacker News (AI keywords)51 days agoNew Tool
office-open-xml-viewer is an open-source browser viewer for Office Open XML documents, rendering DOCX, XLSX, and PPTX files to HTML Canvas. Its parsers are written in Rust and compiled to WebAssembly, while rendering uses the Canvas 2D API. The README also says the full codebase was implemented by Claude through iterative prompting, making it notable as an AI-assisted software development case.
Control 3D Avatars with Natural Language Using "Program as Weights" (programasweights)
r/LocalLLaMA top day51 days agoNew Tool
Developer Yuntian Deng introduced "programasweights," a framework that compiles plain-English descriptions into tiny, local action programs (loops, parallel tracks) to control 3D avatars. Instead of pre-defined buttons, users can command complex sequences like "wave while walking, then jump." The runtime code is open-source and runs entirely offline in the browser or via Python.
Iran Severely Damaged US Air Ops Center in Qatar Soon After War Began
Hacker News (AI keywords)51 days agoIncident
Air & Space Forces Magazine reports that multiple Iranian missiles hit the Combined Air Operations Center at Al Udeid Air Base in Qatar early in the U.S.-Iran war. The facility was reportedly not in use, no injuries were reported, and the air campaign continued from Shaw Air Force Base in South Carolina. The incident raises questions about rebuilding, hardening, dispersing, and networking forward command nodes under missile and drone threats.

← PreviousPage 17Next →

Latest in AI

Anthropic opens Milan office to support Italian enterprise, research, and developers

Anthropic raises $65B Series H at $965B post-money valuation★ 76

What We Learned Mapping a Year's Worth of AI-Enabled Cyber Threats★ 74

An update on our election safeguards

Widening the conversation on frontier AI

Chris Olah's remarks on Pope Leo XIV's AI encyclical

Expanding Project Glasswing

Introducing Claude Opus 4.8★ 82

Thoughts on Gemma4 12B vs 26A4B: Which Is Better?

Do agents.md files help coding agents?

Google's Official Gemma 4 QAT Q4_0 GGUFs Have Higher Precision Than Unsloth's Q4_K_XL

SDSU Wired Its Dorms with 1,300 AI Cameras Without Telling Students

Gemma 4 31B FP8 Matches Claude Sonnet 4.6 Medium in Custom Benchmark★ 75

Jensen Huang Signs Korea Deals with SK Hynix, NAVER and Doosan

Best Local TTS Solution

Algorithmic Monocultures in Hiring★ 78

DeepSeek V4 Pro beats GPT-5.5 Pro on precision

A Matter Wi-Fi Light Bulb in Rust on the Raspberry Pi Pico 2 W

The Open Source Community is backing OpenEnv for Agentic RL

datasette-agent-edit 0.1a0

"Fully Hallucinated Operating System" Simulates an Entire OS via LLM Prompts

How much do amd64 microarchitecture levels help in Go?

Qwen 3.6 27B DeepSWE Benchmark Results Highlight Gap Between Local and Closed-Source Models

Amazing Digital Dentures (a failed project)

Exploring 2-bit QAT: Can Ultra-Compressed Large Models Outperform 4-bit Models Half Their Size?

If LLMs Have Human-Like Attributes, Then So Does Age of Empires II

Gemma-4-26B-A4B QAT Variant Performs Poorly in llama.cpp Compared to Non-QAT Version

Office-open-xml-viewer: Office XML document viewer rendering to HTML Canvas

Control 3D Avatars with Natural Language Using "Program as Weights" (programasweights)

Iran Severely Damaged US Air Ops Center in Qatar Soon After War Began