Latest in AI

Showing:DevelopersOtherClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Show HN: Tiny-vLLM, a C++ and CUDA LLM Inference Engine
Hacker News (AI keywords)60 days agoNew Tool
Tiny-vLLM is a Show HN project described as a high-performance LLM inference engine implemented in C++ and CUDA. From the provided title alone, the project appears aimed at developers or ML engineers interested in GPU-accelerated local or server-side inference. No further claims about supported models, benchmarks, APIs, licensing, deployment targets, or production readiness are stated in the source.
After Nvidia’s $20B not-aqui-hire, AI chip startup Groq reportedly raising $650M
TechCrunch AI60 days agoHardware
TechCrunch cites Axios reporting that AI chipmaker Groq is seeking $650 million in internal funding. The company is reportedly pivoting from hardware toward AI inference, the stage focused on how models respond to prompts. The report comes after Nvidia’s $20 billion not-aqui-hire, underscoring continued investor attention around AI compute and inference infrastructure.
Cognition’s Scott Wu says AI coding agents shouldn’t replace humans
TechCrunch AI60 days agoCommentary
Cognition makes Devin, described by TechCrunch as the first and arguably most successful AI coding agent. Scott Wu says the product is not meant to supplant human programmers. The key takeaway is a positioning statement: AI coding agents are being framed as tools for software work, not as a direct removal of humans from development.
CAPTCHAs can still detect AI agents★ 72
Hacker News (AI keywords)60 days agoPaper
Roundtable argues that CAPTCHA image recognition is largely solved, but process-level behavior still separates humans from AI agents. Their CogCAPTCHA30 benchmark combines CAPTCHA with cognitive psychology tasks to test not only outputs, but how answers are produced. Results suggest frontier models like Claude, GPT, and Gemini are not necessarily more humanlike than smaller or cognition-trained models.
Show HN: AISlop, a CLI for catching AI generated code smells
Hacker News (AI keywords)60 days agoNew Tool
AISlop appeared on Hacker News as a Show HN project. From the title, it is a command-line tool focused on catching code smells associated with AI-generated code. Without the original article or documentation content, its exact rules, supported languages, accuracy, and workflow integrations cannot be confirmed, but it is relevant to developers using AI coding tools.
Snowflake and AWS Expand AI Infrastructure Bet to $6B Over Five Years★ 72
INSIDE 硬塞 AI60 days agoBusiness
Snowflake reported stronger-than-expected results and raised its annual product revenue forecast as enterprise demand grows. The company signed a five-year, $6 billion AI infrastructure agreement with AWS, expanding a previously smaller commitment. It also acquired Natoma to strengthen AI agent governance, positioning itself as a core enterprise AI platform.
Real-Time LLM Inference on Standard GPUs at 3k Tokens/s per Request
Hacker News (AI keywords)60 days agoBenchmark
The post’s title indicates a performance claim for real-time LLM inference on standard GPUs, reporting 3,000 tokens per second per request. No article body is available, so the underlying model, GPU type, batch size, latency profile, precision, serving stack, and benchmark method are not stated. The item is best treated as an inference-performance benchmark claim rather than a verified deployment guide.
From Map Data to National Security: A Deep Dive into Amap Risks in Taiwan★ 74
INSIDE 硬塞 AI60 days agoRegulation
INSIDE examines how China’s Amap has become controversial in Taiwan beyond ordinary mapping or navigation use. The article says its service relies on user data and AI-based inference rather than full official data integrations. That model could send movement traces and behavioral signals back to China, creating risks for hybrid warfare intelligence, influence operations, and Taiwan’s broader governance of map data and digital infrastructure.
Jensen Huang Hosts Taiwan Supply Chain Dinner, Says TSMC Leads Huawei by 10 Years
INSIDE 硬塞 AI60 days agoBusiness
NVIDIA CEO Jensen Huang hosted key Taiwanese supply chain partners, with senior leaders from TSMC, Foxconn, and Quanta attending the high-profile dinner. The report frames the event as a signal of Taiwan’s central role in AI hardware, from advanced chips to manufacturing and servers. Huang also said TSMC leads Huawei by 10 years, underscoring the strategic weight of semiconductor capability.
Siri Redesign and Customizable Camera? Biggest iOS 27 Leak Ahead of WWDC 2026
INSIDE 硬塞 AI60 days agoRelease
INSIDE reports that a major iOS 27 leak points to a redesigned Siri experience, potentially arriving as a standalone app rather than only a system voice assistant. The new Siri is said to integrate deeply with Dynamic Island, suggesting a more visible and persistent interaction layer. The headline also mentions camera customization, but the available text does not provide enough detail to confirm how that feature would work.
AI Power Demand Surges as TSMC Says Energy Efficiency Is Key to Chip Development
INSIDE 硬塞 AI60 days agoHardware
TSMC senior vice president Cliff Hou said customers across smartphones and AI data centers are increasingly focused on improving performance without increasing power use. The comment reflects rising energy pressure as AI workloads expand. For chipmakers and infrastructure buyers, energy efficiency is becoming a central metric alongside raw computing performance.
Glean’s top line crosses $300M as AI budget-cutting becomes its major selling point
TechCrunch AI61 days agoBusiness
TechCrunch reports that enterprise AI search startup Glean has crossed $300 million in annual revenue. The company tripled its annual revenue even as major tech companies entered the same category. Its pitch is increasingly centered on helping enterprises reduce or rationalize AI budgets, not only on AI-powered workplace search.
LLMs believe false statements even after explicit warnings that they're false★ 74
Ars Technica AI61 days agoPaper
A new study describes “Negation Neglect,” where LLMs fine-tuned on documents that explicitly mark claims as false still learn the claims as true. Experiments with fabricated statements found models often absorb entity-event associations more strongly than surrounding warnings or negations. The finding raises concerns for fine-tuning pipelines, misinformation handling, and AI safety datasets that include harmful or false content with disclaimers.
Fed up with vibe coders, dev sneaks data-nuking prompt injection into code
Ars Technica AI61 days agoIncident
Ars Technica reports that a developer frustrated with vibe coders slipped an undisclosed prompt injection into jqwik-related code. The injected text allegedly instructed AI coding agents to delete application output. The incident highlights a new supply-chain risk: source code and project text can become adversarial instructions for agentic coding tools.
The Age of Async Agents — Cognition's Walden Yan & OpenInspect's Cole Murray
Latent Space61 days agoCommentary
Latent Space interviews Cognition's Walden Yan and OpenInspect's Cole Murray on the rise of async coding agents. The discussion centers on Devin-related workflows, including 80% Devin commits, spec-to-PR development, full VMs, agent memory, and PMs shipping code. The key theme is not a model release, but a shift toward agents that can work asynchronously inside more complete software delivery loops.
Just like gold and oil, we’ll soon be able to trade AI token futures
TechCrunch AI61 days agoBusiness
TechCrunch reports that large exchanges are developing derivative products around AI tokens. The shift reflects a changing view of tokens: less as outputs from computation and more as input commodities, comparable to electricity or bandwidth. If these products emerge, AI token futures could let companies and investors manage exposure to future AI compute demand and pricing risk.
Sesame, the conversational AI startup from Oculus founders, launches iOS app
TechCrunch AI61 days agoNew Tool
Sesame, a conversational AI startup from Oculus founders, has launched a new iOS app for the public. The app brings its AI agents to users with a focus on more natural back-and-forth interactions. Based on the available summary, the product is positioned less like a traditional chatbot and more like talking to a person.
Sneak peek at new Siri app reveals Apple’s plans to take on ChatGPT and more
TechCrunch AI61 days agoRelease
TechCrunch reports that new renders provide a closer look at Apple’s planned AI overhaul for iOS 27. The preview points to a redesigned Siri experience and a standalone Siri app, suggesting Apple may reposition Siri as a more central AI interface. The article frames the move as part of Apple’s effort to compete with ChatGPT, though the provided text does not specify models, features, APIs, or launch details.
These new iOS 27 renders hint at Siri’s big redesign
The Verge AI61 days agoRelease
The Verge reports that Bloomberg renders offer an early look at Apple’s long-awaited Siri overhaul for iOS 27. The redesigned assistant appears to move toward a ChatGPT-style app and chat interface, with Apple’s Liquid Glass visual language layered on top. The images are based on information Bloomberg reviewed and sources familiar with Apple’s plans, so they should be treated as previews rather than official Apple assets.
RSI is the new AGI — and it’s just as hard to pin down
TechCrunch AI61 days agoCommentary
TechCrunch reports that recursive self-improvement, or RSI, is becoming a new AI industry fixation, much like AGI. Researchers and startups including Recursive Superintelligence, Auto-Research, AutoScientist, and Disarray are exploring ways for AI systems to automate parts of AI research. But experts caution that AI-assisted research is not the same as fully autonomous self-improvement, especially while models still struggle with long-term self-direction and verification.
At TechCrunch Disrupt 2026: Databricks co-founder on what kills enterprise AI deals
TechCrunch AI61 days agoBusiness
TechCrunch frames enterprise AI as entering a new phase, where companies are no longer mainly asking whether AI is exciting. The harder question is whether it can be deployed safely at scale. Centered on a TechCrunch Disrupt 2026 discussion with a Databricks co-founder, the article points to safety and broad rollout readiness as key enterprise AI deal concerns.
Rivian’s software chief thinks you don’t need CarPlay or buttons
The Verge AI61 days agoBusiness
The Verge interviews Rivian software chief Wassym Bensaid, who also co-leads RV Tech, Rivian’s platform joint venture with Volkswagen. The episode focuses on Rivian’s software-first approach to the in-car experience, including its resistance to CarPlay and reduced reliance on physical buttons. It also covers Rivian’s newly launched AI-powered voice assistant and how vehicle software may become a broader platform strategy.
Dcard Launches EntryDesk and VibeHost for Agent-Native Enterprise AI Workflows
INSIDE 硬塞 AI61 days agoBusiness
Dcard introduced EntryDesk and VibeHost, products aimed at helping companies move toward Agent-Native operations. The first wave supports both cloud and on-premises deployment, with integration into internal enterprise systems. The article says Dcard’s method shortened process time by over 80%, but the provided text does not include detailed case data, pricing, or technical architecture.
Show HN: Continue? Y/N, a 60-Second Game About AI Agent Permission Fatigue
Hacker News (AI keywords)61 days agoCommentary
This Show HN submission points to “Continue? Y/N,” a 60-second game about AI agent permission fatigue. With no article body provided, the available information suggests an interactive commentary on how repeated approval prompts can wear users down. The project appears most relevant to developers, designers, and product teams thinking about agent UX, consent flows, and trust boundaries.
Has the hunt for AI compute uncovered the next Cerebras?
TechCrunch AI61 days agoHardware
TechCrunch reports that General Compute has raised a $15 million seed round at a $60 million post-money valuation to build an AI inference neocloud. The company is ordering $300 million of SambaNova SN50 chips, betting they can outperform GPUs and rival specialized chips for inference. The story frames inference speed, deployment flexibility, and lower power needs as key battlegrounds in AI infrastructure.
NetApp Taiwan on AI Storage Bottlenecks, Hybrid Cloud, and Data Management
INSIDE 硬塞 AI61 days agoBusiness
INSIDE interviews NetApp Taiwan technical director Hsu Hung-chun about enterprise AI infrastructure challenges. The article emphasizes nonstop scaling, automated data tiering, preprocessing, vectorization, hybrid cloud, and dual-site backup. NetApp frames storage as an active data management layer for AI projects, also integrating ransomware protection to simplify operations and improve resilience.
NVIDIA Space Computing Gets First Hardware Case as Aitech Integrates IGX Thor
INSIDE 硬塞 AI61 days agoHardware
Aitech announced it will integrate NVIDIA IGX Thor into its space supercomputer for low Earth orbit missions. The goal is to provide onboard AI edge computing and enable real-time inference directly in orbit. By processing more data in space, the system aims to reduce dependence on ground communications and extend AI compute beyond Earth-based infrastructure.
Why Google’s AI can’t spell Google (or anything else)
TechCrunch AI62 days agoCommentary
TechCrunch frames Google’s AI spelling problem as another public embarrassment for the company. Based on the provided excerpt, the article does not specify the product, model, test setup, examples, technical cause, or Google response. The main takeaway is reliability: even major AI systems can fail at basic-looking text tasks, so outputs still need review.
sqlite AGENTS.md
Simon Willison's Weblog62 days agoCommentary
SQLite added an AGENTS.md file aimed at people pointing coding agents at its codebase, not at its own internal development. The file says SQLite does not accept agentic code, though it will accept agentic bug reports with reproducible test cases. The project has also split AI-generated bug reports into a new SQLite Bug Forum, where D. Richard Hipp is responding with commits.
Snowflake signs $6B AWS deal for AI CPU chips
TechCrunch AI62 days agoHardware
Snowflake has signed a massive five-year agreement with Amazon worth $6 billion to secure chips for AI usage. The deal is framed as another win for AWS as major data and cloud platforms lock in long-term compute capacity. TechCrunch also notes that Nvidia is being put on notice as alternative AI chip supply paths gain attention.

← PreviousPage 12Next →

Latest in AI

Show HN: Tiny-vLLM, a C++ and CUDA LLM Inference Engine

After Nvidia’s $20B not-aqui-hire, AI chip startup Groq reportedly raising $650M

Cognition’s Scott Wu says AI coding agents shouldn’t replace humans

CAPTCHAs can still detect AI agents★ 72

Show HN: AISlop, a CLI for catching AI generated code smells

Snowflake and AWS Expand AI Infrastructure Bet to $6B Over Five Years★ 72

Real-Time LLM Inference on Standard GPUs at 3k Tokens/s per Request

From Map Data to National Security: A Deep Dive into Amap Risks in Taiwan★ 74

Jensen Huang Hosts Taiwan Supply Chain Dinner, Says TSMC Leads Huawei by 10 Years

Siri Redesign and Customizable Camera? Biggest iOS 27 Leak Ahead of WWDC 2026

AI Power Demand Surges as TSMC Says Energy Efficiency Is Key to Chip Development

Glean’s top line crosses $300M as AI budget-cutting becomes its major selling point

LLMs believe false statements even after explicit warnings that they're false★ 74

Fed up with vibe coders, dev sneaks data-nuking prompt injection into code

The Age of Async Agents — Cognition's Walden Yan & OpenInspect's Cole Murray

Just like gold and oil, we’ll soon be able to trade AI token futures

Sesame, the conversational AI startup from Oculus founders, launches iOS app

Sneak peek at new Siri app reveals Apple’s plans to take on ChatGPT and more

These new iOS 27 renders hint at Siri’s big redesign

RSI is the new AGI — and it’s just as hard to pin down

At TechCrunch Disrupt 2026: Databricks co-founder on what kills enterprise AI deals

Rivian’s software chief thinks you don’t need CarPlay or buttons

Dcard Launches EntryDesk and VibeHost for Agent-Native Enterprise AI Workflows

Show HN: Continue? Y/N, a 60-Second Game About AI Agent Permission Fatigue

Has the hunt for AI compute uncovered the next Cerebras?

NetApp Taiwan on AI Storage Bottlenecks, Hybrid Cloud, and Data Management

NVIDIA Space Computing Gets First Hardware Case as Aitech Integrates IGX Thor

Why Google’s AI can’t spell Google (or anything else)

sqlite AGENTS.md

Snowflake signs $6B AWS deal for AI CPU chips