Latest in AI

Showing:DevelopersOtherClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Quick note on recent QAT issues
r/LocalLLaMA top day49 days agoCommentary
The post argues that recent Google QAT quantization has several implementation problems, including token embeddings being quantized to q6k instead of using a pure mode. It also claims llama-quantize has a hardcoded parameter that mismatches some optimized groups, and that 32-block groups are misaligned. The author recommends Unsloth UD Q4_K_XL as a temporary option and says they are working on a patch.
Apple plays catch-up at WWDC
TechCrunch AI49 days agoCommentary
Apple spent much of its WWDC keynote on fixes, performance improvements, and long-requested features before unveiling an upgraded AI-powered Siri. The sequencing suggests Apple wants users to see AI as one piece of a larger software-improvement effort. TechCrunch frames the event as Apple playing catch-up, rather than leading with AI as the sole headline.
Apple bets cheaper AI will woo small developers
TechCrunch AI49 days agoBusiness
Apple is trying to make AI experimentation cheaper for smaller developers. According to TechCrunch, developers with fewer than 2 million first-time App Store downloads will have cloud API costs waived. The report frames this as a way to attract smaller teams as AI development and experimentation become increasingly expensive.
llama.cpp PR adds MTP support for Gemma-4 E2B and E4B assistants
r/LocalLLaMA top day49 days agoRelease
The Reddit post links to ggml-org/llama.cpp Pull Request #24282, which adds MTP support for Gemma-4 E2B and E4B assistants. The submitter frames it as useful for tiny Gemma models on phones, low-end machines, Raspberry Pi, or similarly constrained devices. The post does not include benchmarks, merge status, or setup instructions, so it should be treated as a development signal rather than a finished release.
Introducing FrontierCode★ 78
Hacker News (AI keywords)49 days agoBenchmark
Cognition launched FrontierCode, a coding benchmark focused on mergeability rather than only functional correctness. It evaluates correctness, tests, scope discipline, style, and repository-specific quality standards. Built with open-source maintainers and extensive quality control, it shows current frontier models still struggle: Claude Opus 4.8 scores 13.4% on the hardest Diamond subset, ahead of GPT-5.5 and Gemini 3.1 Pro.
Say hi to Siri AI: Apple announces more conversational voice assistant★ 76
Ars Technica AI49 days agoRelease
Apple announced “Siri AI,” a more conversational version of its voice assistant planned for this fall. The update is tied to a two-tier AI model overhaul powered in part by Google technology. The move signals Apple’s attempt to close the gap with modern AI assistants while preserving its system-level integration and privacy-focused positioning.
Was BitNet a dead end? What happened to ternary LLMs?
r/LocalLLaMA top day49 days agoCommentary
A r/LocalLLaMA user questions whether BitNet and ternary LLMs were a dead end after earlier promise around efficient low-bit models. The post notes that the largest ternary model appears to remain around 2B parameters. It asks why frontier open-weight AI labs are not visibly pursuing the approach, but provides no technical evidence or definitive answer.
Apple just taught your iPhone to finish your sentences, photos, and workflows
TechCrunch AI50 days agoRelease
Apple is bringing new AI-powered features to Safari, Shortcuts, and Passwords apps. The framing suggests AI will be embedded into everyday iPhone tasks, including writing, photo-related actions, and workflow automation. The provided source text does not include details on exact capabilities, device support, privacy design, or rollout timing, so the practical impact remains unclear.
Apple Core AI Framework★ 76
Hacker News (AI keywords)50 days agoRelease
Apple’s Core AI framework is positioned as a developer stack for deploying AI models directly inside apps on Apple silicon. The documentation describes Swift APIs, `.aimodel` assets, model specialization, caching, Xcode profiling, and debugging tools. It appears aimed at developers building low-latency, privacy-conscious on-device inference workflows, though the documentation is marked as preliminary beta information.
Apple will let you build workflows using AI in its new Shortcuts app
TechCrunch AI50 days agoRelease
Apple is upgrading the Shortcuts app in iOS 27 with AI-powered workflow creation. Users will be able to describe what they want in natural language, and Apple Intelligence will assemble the needed system and app actions. The feature is meant to make Shortcuts more approachable for non-technical users, with the updated app expected to roll out with iOS 27 this fall.
For the 2nd time in weeks, Microsoft packages laced with credential stealer★ 72
Ars Technica AI50 days agoIncident
Ars Technica reports a second Microsoft-package security incident in weeks, involving 73 packages laced with a credential stealer. The supplied summary says the malware runs as soon as the packages are opened by an AI agent and can self-replicate. The case highlights a growing software supply-chain risk: AI agents that inspect or operate on code may become execution triggers for malicious packages.
Apple gives Siri its own dedicated app
TechCrunch AI50 days agoNew Tool
TechCrunch reports that Siri is finally getting its own dedicated app. The provided text does not include details about features, launch timing, supported devices, or AI capabilities. The move could signal a more prominent product surface for Siri, but the available source text is too limited to confirm broader strategy or functionality.
Apple is fixing bill splitting with its new Siri in Camera feature
TechCrunch AI50 days agoRelease
Apple is working on a Siri in Camera feature aimed at simplifying bill splitting after meals. Users can point an iPhone at a restaurant bill, select what they ordered, and split the tab using Apple Cash. The provided source does not specify launch timing, regional availability, language support, or how the feature handles taxes, tips, or complex shared orders.
Siri AI
Hacker News (AI keywords)50 days agoRelease
Apple’s Apple Intelligence page presents Siri AI as a more capable assistant with natural conversations, personal context, cross-app actions, and a dedicated app. It also highlights Visual Intelligence across iPhone, iPad, Mac, and Apple Vision Pro, plus AI photo and image tools. Since the HN item provides only the title, this should be treated as a product preview rather than a technical deep dive.
Apple’s long-awaited AI Siri overhaul is finally here★ 72
TechCrunch AI50 days agoRelease
TechCrunch reports that Apple’s long-awaited AI overhaul of Siri has arrived. The idea behind the new “Siri AI” is to shift Siri beyond a voice-controlled assistant into an AI companion that can do more. The provided article text does not specify concrete features, supported devices, rollout timing, or technical details.
Full Reverse Engineering of the TI-84 Plus Operating System
Hacker News (AI keywords)50 days agoHardware
This Hacker News item links to an article titled “Full Reverse Engineering of the TI-84 Plus Operating System.” Based on the provided material, the reliable takeaway is that it concerns reverse engineering the OS of Texas Instruments’ TI-84 Plus graphing calculator. The original text was not provided, so specific claims about methods, findings, code, memory layout, or security implications cannot be verified here.
Apple announces Siri AI and its next generation of Apple Intelligence★ 72
The Verge AI50 days agoRelease
Apple revealed a new round of AI features at WWDC, centered on a smarter and more personalized Siri. The announcement comes two years after Apple first outlined Apple Intelligence and a more capable Siri that The Verge says never fully materialized. Apple describes Siri AI as an entirely new version of Siri, with stronger conversational ability and broader capabilities.
LocalLLaMA post urges users not to join SpaceX, OpenAI, Anthropic IPOs
r/LocalLLaMA top day50 days agoOpinion
A popular r/LocalLLaMA post urges local LLM supporters not to invest in IPOs tied to SpaceX, OpenAI, or Anthropic. The author argues that frontier labs drive up demand and prices for GPUs, RAM, SSDs, HDDs, and NAS hardware, making local inference harder. The post also questions AI company valuations, but its claims are mostly opinion and speculation without cited evidence.
WWDC 2026: Everything announced on Siri, iOS 27, Apple Intelligence and more
TechCrunch AI50 days agoRelease
Apple’s WWDC 2026 kicked off at Apple Park with expected announcements around Siri, iOS 27, Apple Intelligence, and developer demos. The event is notable as Tim Cook’s last WWDC as CEO before John Ternus takes over on September 1. Early updates include Liquid Glass opt-in adjustments, iOS 27 support back to iPhone 11, and claimed speed gains for Photos, AirDrop, and multitasking.
I bundled a fully local LLM inside my Unity game
r/LocalLLaMA top day50 days agoRelease
A developer shared a Unity game, Simulation Simulator, that bundles a local LLM with no internet, cloud service, or API key required. The game is a campfire chat simulator about DMT, simulation theory, and a monitor-headed friend, with five endings driven by natural AI interaction. The author sees this as a path toward richer NPCs, while noting local TTS and translation are still too slow for smooth gameplay.
Xiaomi Claims 1,000+ TPS on a 1T Model Using a Standard 8-GPU Server★ 72
r/LocalLLaMA top day50 days agoBenchmark
Xiaomi announced MiMo-V2.5-Pro-UltraSpeed with TileRT, claiming over 1,000 tokens/s decode speed on a 1-trillion-parameter MoE model. The company says it runs on a single standard 8-GPU commodity node, not wafer-scale or SRAM-heavy specialized hardware. The claimed stack combines FP4 MoE expert quantization, DFlash speculative decoding, and TileRT low-latency inference kernels, but independent validation is still needed.
OpenEnv coordination expands to HF, PyTorch, Unsloth, Modal, and more
r/LocalLLaMA top day50 days agoNew Tool
OpenEnv is a tool for creating agentic execution environments such as terminals, browsers, or other systems an agent can interact with. The project will now be coordinated by a committee including Meta-PyTorch, Reflection, Unsloth, Modal, Prime Intellect, Nvidia, Mercor, Fleet AI, and Hugging Face. The post also lists many AI organizations supporting or adopting OpenEnv, positioning it as infrastructure for open-source agent training.
Microsoft’s AI chief says superintelligence is near, but won’t take your job
The Verge AI50 days agoBusiness
The Verge interviews Microsoft AI CEO Mustafa Suleyman about the company’s approach to advanced AI, superintelligence, AGI, OpenAI, and automation. His message is that more powerful AI systems are arriving soon, but Microsoft wants them to remain human-controlled and human-serving. The piece is less a product announcement than a window into Microsoft’s strategic framing of AI progress and job disruption.
mtmd adds video input support in llama.cpp★ 72
r/LocalLLaMA top day50 days agoRelease
ggml-org/llama.cpp merged PR #24269, adding video input support to mtmd through mtmd-cli and /chat/completions, which also enables the web UI path. The implementation invokes a locally installed ffmpeg subprocess instead of bundling codec support, and currently extracts visual frames only, with no audio support yet. It was tested with Qwen3-VL-2B in CLI and Gemma 4 E4B in web UI, making local multimodal video experiments more accessible.
Gemma 4 Chat Template now has preserve thinking
r/LocalLLaMA top day50 days agoRelease
A r/LocalLLaMA post notes that Gemma 4’s chat template now has “preserve thinking.” The linked discussion points to google/gemma-4-31B-it on Hugging Face, suggesting a template-level change rather than a new model release or benchmark. The original post does not provide detailed usage notes, defaults, compatibility information, or measured effects.
The crash that vanished: control and emergence in a five-model economy
Hugging Face Blog50 days agoCommentary
With no source text provided, this can only be inferred from the title. The post appears to examine a five-model economy where a potential crash disappears under some form of control or changed system dynamics. Its likely relevance is in multi-agent or multi-model systems, where collective behavior can diverge from individual model behavior.
llama.cpp PR #24277 avoids KV cell copies in kv-cache
r/LocalLLaMA top day50 days agoRelease
ggml-org/llama.cpp merged PR #24277 by ggerganov, titled “kv-cache: avoid kv cells copies.” The Reddit post says the change improves MTP performance for Gemma-4 and was merged the previous day. It is available starting with the b9551 release, making it relevant for local inference users tracking llama.cpp performance updates.
NVIDIA and LG Group Build AI Factory for Physical AI, Mobility and Infrastructure★ 72
Hacker News (AI keywords)50 days agoHardware
NVIDIA and LG Group announced an AI factory collaboration spanning robotics, autonomous driving, data center technologies and GPU cloud services. The effort connects NVIDIA Isaac, Cosmos, DRIVE, DSX, Blackwell GPUs, NeMo and TensorRT-LLM with LG’s manufacturing, robotics, mobility and infrastructure businesses. The partnership also supports LG’s EXAONE sovereign AI model work and broader enterprise AI adoption across the group.
WWDC 2026: How to Watch and What to Expect from Apple's Big Event★ 78
The Verge AI50 days agoCommentary
Apple's annual WWDC 2026 is just around the corner, spotlighting upcoming updates for iOS, macOS, and other operating systems. The headline expectation is a massive, AI-driven overhaul for Siri, aiming to make the assistant far more capable. This guide covers how to watch the keynote live and what major announcements to prepare for.
Replies to Comments on "LLMs Are Eroding My Career"★ 75
Hacker News (AI keywords)50 days agoCommentary
The author addresses widespread feedback on their viral post about LLMs eroding the software engineering career. They counter the "just don't use it" argument by explaining how industry expectations have already shifted. The post highlights why reviewing AI-generated code is more cognitively exhausting than writing it, and warns about the long-term impact on junior developers' skill acquisition.

← PreviousPage 6Next →

Latest in AI

Quick note on recent QAT issues

Apple plays catch-up at WWDC

Apple bets cheaper AI will woo small developers

llama.cpp PR adds MTP support for Gemma-4 E2B and E4B assistants

Introducing FrontierCode★ 78

Say hi to Siri AI: Apple announces more conversational voice assistant★ 76

Was BitNet a dead end? What happened to ternary LLMs?

Apple just taught your iPhone to finish your sentences, photos, and workflows

Apple Core AI Framework★ 76

Apple will let you build workflows using AI in its new Shortcuts app

For the 2nd time in weeks, Microsoft packages laced with credential stealer★ 72

Apple gives Siri its own dedicated app

Apple is fixing bill splitting with its new Siri in Camera feature

Siri AI

Apple’s long-awaited AI Siri overhaul is finally here★ 72

Full Reverse Engineering of the TI-84 Plus Operating System

Apple announces Siri AI and its next generation of Apple Intelligence★ 72

LocalLLaMA post urges users not to join SpaceX, OpenAI, Anthropic IPOs

WWDC 2026: Everything announced on Siri, iOS 27, Apple Intelligence and more

I bundled a fully local LLM inside my Unity game

Xiaomi Claims 1,000+ TPS on a 1T Model Using a Standard 8-GPU Server★ 72

OpenEnv coordination expands to HF, PyTorch, Unsloth, Modal, and more

Microsoft’s AI chief says superintelligence is near, but won’t take your job

mtmd adds video input support in llama.cpp★ 72

Gemma 4 Chat Template now has preserve thinking

The crash that vanished: control and emergence in a five-model economy

llama.cpp PR #24277 avoids KV cell copies in kv-cache

NVIDIA and LG Group Build AI Factory for Physical AI, Mobility and Infrastructure★ 72

WWDC 2026: How to Watch and What to Expect from Apple's Big Event★ 78

Replies to Comments on "LLMs Are Eroding My Career"★ 75