Latest in AI

Showing:ResearchersClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Co-Existence and the End of Co-Intelligence
One Useful Thing (Mollick)54 days agoCommentary
Ethan Mollick’s One Useful Thing post announces or frames Co-Existence, the follow-up to Co-Intelligence. The core shift is from prompting chatbots as collaborators toward living and working alongside increasingly embedded AI systems. It is best read as commentary and book positioning, not a technical release, benchmark, or tool tutorial.
These LLMs are the best at resisting Russian propaganda
Ars Technica AI54 days agoBenchmark
Ars Technica reports on an Estonian government benchmark evaluating how large language models handle Russian propaganda. The test focuses on whether dozens of models resist, repeat, or normalize Russia’s strategic narratives. The topic matters for governments, researchers, and AI builders because LLMs are increasingly used to summarize and mediate public information.
Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs
Latent Space54 days agoBenchmark
Latent Space talks with Lukas Petersson and Axel Backlund of Andon Labs, the authors behind VendingBench. The episode focuses on evaluating Claude models across a range from Haiku to Mythos. It also discusses how they build frontier evals from scratch, with an emphasis on creating benchmarks that remain useful and meaningful over time.
Elon Musk tries again to escape FTC audits of X data handling
Ars Technica AI54 days agoRegulation
Ars Technica reports that Elon Musk is again seeking to escape FTC audits over how X handles user data. Public commenters warned the FTC that Musk cannot be trusted to protect X users’ privacy. The story centers on platform governance, privacy oversight, and whether external audits should remain in place for X’s data practices.
Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI
Hugging Face Blog54 days agoRelease
NVIDIA’s Nemotron 3.5 Content Safety is positioned as a customizable multimodal safety layer for global enterprise AI. Based on the title, it appears focused on content moderation and policy enforcement across AI applications, potentially including text and visual contexts. Without the full article, details such as benchmarks, licensing, supported languages, deployment paths, and model specifications should not be assumed.
Google Requested a Slightly Different AI Statement
Simon Willison's Weblog54 days agoEthics
Simon Willison quotes Emanuel Maiberg of 404 Media about a post-publication request from Google. After the story ran, Google asked the outlet to publish a slightly different version of its statement. The notable change: the revised statement no longer said it was critical to maintain humans in the loop, raising questions about corporate AI accountability language.
The LLM warnings Google fired Timnit Gebru over have all come true
Hacker News (AI keywords)54 days agoEthics
The post frames Timnit Gebru’s dispute with Google as an early warning about large language model risks. Based on the available title, it appears to argue that concerns around bias, accountability, concentration of power, and deployment risks have since become visible in practice. This is best read as AI ethics commentary, not a model release or technical tutorial.
Is Silicon Valley ready to put robots in people’s homes? Hello Robot is.
TechCrunch AI54 days agoHardware
Hello Robot has released Stretch 4, the fourth generation of its home assistance robot. The company is taking a cautious, deployment-first approach, using a wheeled base, telescoping arm, sensors, and human-in-the-loop control rather than promising a general-purpose humanoid. TechCrunch frames Stretch as a practical bet on real household data, assistive use cases, and safer hardware for people with mobility challenges.
How some data center operators are tackling their water use problems
Ars Technica AI54 days agoBusiness
Ars Technica examines how hyperscalers and data center operators are facing pressure over water use. The issue centers on local water availability and quality as AI infrastructure expands. The provided excerpt says some operators are trying to address the problem, but does not specify companies, methods, or measured results.
How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent
Hugging Face Blog54 days agoTutorial
This Hugging Face Blog post appears to be a practical tutorial for fine-tuning NVIDIA Nemotron 3.5 ASR. Based on the title, it focuses on adapting speech recognition to a target language, specialized domain, or accent. The original text was not provided, so implementation details, datasets, commands, metrics, and hardware requirements cannot be confirmed.
Let us filter AI slop, you cowards
The Verge AI54 days agoCommentary
The article says AI-generated content has become nearly impossible to avoid online. Platforms such as YouTube, Instagram, and TikTok have expanded authentication efforts and increasingly label AI-made images, videos, and music. The author argues that labels are not enough: if platforms can identify AI content, they should give users controls to filter or reduce it.
EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios
Hugging Face Blog54 days agoBenchmark
ServiceNow AI published a Hugging Face Blog post titled “EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios.” Based only on the title, it appears to be a benchmark dataset update involving tool-use or scenario-based AI evaluation. The exact domains, tools, scenario design, licensing, supported models, and evaluation methodology cannot be confirmed without the full article.
AI leaders call for tougher protections against AI-aided bioweapons★ 76
The Verge AI54 days agoRegulation
Major AI rivals including leaders from Anthropic, OpenAI, Microsoft, Meta, and Google DeepMind signed an open letter urging US lawmakers to close a biosecurity gap. They want companies selling synthetic DNA and RNA to screen orders for sequences that could help create dangerous pathogens. The concern is that more capable AI tools and cheaper biology infrastructure could lower barriers to misuse.
Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining
Hugging Face Blog54 days agoTutorial
The post appears to focus on generating synthetic Q&A data from task seeds for Nemotron pretraining. Rather than a model launch, it likely emphasizes data generation and pretraining corpus design. Because the original article text is unavailable here, concrete claims about dataset scale, benchmarks, or implementation details should not be inferred.
TSMC Buys High-NA EUV but Delays Mass Production, Raises Capex to $56B★ 72
INSIDE 硬塞 AI55 days agoHardware
At TSMC’s shareholder meeting, the company said it has purchased High-NA EUV equipment but has not yet moved it into mass production due to high costs. TSMC also raised capital expenditure to $56 billion, signaling continued heavy investment in advanced manufacturing capacity. CEO C.C. Wei also pledged more than 30% annual growth in dividends and employee bonuses, while saying the company must expand its social responsibility efforts.
New solid-state battery material works from -40°C to 55°C
INSIDE 硬塞 AI55 days agoHardware
Researchers developed a solid polymer electrolyte using an in-situ polymerization process to address the tradeoff between ionic conductivity and high-voltage stability. The reported material enables lithium-metal batteries to operate from -40°C to 55°C and maintain stable cycling at 4.5V. The work suggests automotive potential, though commercial readiness, long-term durability, cost, and scale-up details were not established in the provided source.
Nemotron 3 Ultra now available on AI Gateway
Vercel Changelog55 days agoRelease
Vercel’s changelog says Nemotron 3 Ultra is now available on AI Gateway. With no source body provided, the confirmed takeaway is limited to model availability through Vercel’s gateway layer. Details such as pricing, model string, benchmarks, context length, latency, provider routing, and feature support are not available from the supplied text.
Jensen Huang Highlights Harness as a Key AI Agent Architecture Component
INSIDE 硬塞 AI55 days agoCommentary
INSIDE reports that Jensen Huang highlighted one slide as the “most important” during a multi-hour technical keynote. The slide presented the core architecture of AI agents, with Harness described as its most mysterious and critical component. The article focuses on why Harness matters in understanding agentic AI systems, while the provided source excerpt does not define it as a specific product or implementation.
Reve 2 and Ideogram 4: Layouts in Imagegen
Latent Space55 days agoRelease
Latent Space’s roundup frames image composition as a major barrier now being tackled by layout-aware image models. Reve 2.0 emphasizes precise generation and editing with layouts, while Ideogram 4.0 uses bounding boxes tied to region descriptions. The issue also covers MAI-Thinking-1, Gemma 4 12B, open audio models, agent execution layers, and model-routing cost debates.
I built a vulnerable app and spent $1,500 seeing if LLMs could hack it
Hacker News (AI keywords)55 days agoBenchmark
The author built a vulnerable React Native app with a Python backend and a Firebase access-control flaw. GPT 5.5 solved 7 of 10 runs, while Deepseek and Claude variants solved fewer attempts. Many other models failed due to refusals, API-focused tunnel vision, false positives, or inability to use the exposed Firebase path correctly.
How we contain Claude across products★ 74
Hacker News (AI keywords)55 days agoCommentary
Anthropic describes containment as the core security strategy for increasingly capable Claude agents. The post compares ephemeral containers for claude.ai, OS-level sandboxing and approvals for Claude Code, and VM isolation for Claude Cowork. It also details missed risks, including pre-trust project config execution, user-delivered prompt injection, exfiltration through approved domains, and reduced enterprise visibility inside VMs.
Designing the hf CLI as an agent-optimized way to work with the Hub
Hugging Face Blog55 days agoCommentary
Based only on the title, this Hugging Face post appears to explain how the hf CLI is being designed for AI agents working with the Hub. It likely focuses on command-line ergonomics, automation, and predictable interactions with Hub resources. Without the full text, specific features, supported agents, or implementation details should not be inferred.
Show HN: Mnemo - local-first AI memory layer for any LLM
Hacker News (AI keywords)55 days agoNew Tool
Mnemo is presented as a Show HN project that provides a local-first AI memory layer for any LLM. The title indicates it is built with Rust, SQLite, and petgraph, suggesting local storage and graph-based memory relationships. Since no article body is available, details such as API design, retrieval methods, maturity, and production readiness cannot be confirmed.
Google ordered to add clearer AI Search links and let UK publishers opt out★ 78
Ars Technica AI55 days agoRegulation
The UK CMA is requiring Google to let publishers opt out of having content used in AI Overviews, AI Mode, and related generative search features. Google must also provide clearer attribution and links in AI-generated search results. The move targets publisher concerns that AI summaries reduce referral traffic while relying on original web content.
How LLMs Actually Work
Hacker News (AI keywords)55 days agoTutorial
The article explains how modern LLMs convert text into token IDs, embeddings, and position-aware vectors before passing them through stacked transformer blocks. It covers attention, multi-head attention, KV cache, GQA, feed-forward networks, MoE, residual streams, normalization, and decoding. Its goal is educational: helping readers understand the common architecture behind many current model families and read model cards or papers more confidently.
Scaling Past Informal AI - Carina Hong, Axiom Math
Latent Space55 days agoCommentary
Latent Space interviews Carina Hong of Axiom Math on verified generation and compounding intelligence. The discussion centers on moving AI from plausible informal answers toward outputs that can be checked or proven. For builders and researchers, the theme matters because verification may become a core layer for reliable reasoning in math, software, and other high-stakes domains.
Google's Gemma 4 12B is designed to run on 16GB RAM laptops
Ars Technica AI55 days agoRelease
Google introduced Gemma 4 12B, an open model aimed at running locally on laptops with 16GB of RAM. The model uses a new encoding scheme and token prediction to improve efficiency relative to its size. Its practical importance depends on real-world benchmarks, but it could lower the barrier for private, offline, and local multimodal AI workflows.
Trump AI testing plan faces problem: DOGE gutted US security teams
Ars Technica AI55 days agoRegulation
Ars Technica reports that Trump’s administration is considering government safety tests for advanced AI models before deployment. Critics argue the plan may be short-sighted and performative because DOGE cuts have weakened the US teams best positioned to conduct serious AI security reviews. The concern is that testing without staffing, transparency, and enforcement may not prevent dangerous deployments.
No, Artificial Intelligence Is Not Conscious★ 72
Hacker News (AI keywords)55 days agoOpinion
Ted Chiang criticizes the anthropomorphic framing around Anthropic’s Claude and its constitution. He argues that LLMs are sentence-continuation systems producing fictional conversational roles, not entities with subjective experience. The essay warns that presenting chatbots as morally aware risks misleading users and shifting responsibility away from humans and companies.
How Embryos Shape Their Limbs: A Key Discovery of Genetic Brakes
Hacker News (AI keywords)55 days agoPaper
A Université de Montréal and IRCM team reports in PNAS that Polycomb complexes PRC1 and PRC2 act as genetic brakes during mouse limb development. These systems silence early developmental genes so later programs can proceed. Disrupting one system alters gene expression; disrupting both keeps early genes active and severely compromises normal limb formation.

← PreviousPage 20Next →

Latest in AI

Co-Existence and the End of Co-Intelligence

These LLMs are the best at resisting Russian propaganda

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

Elon Musk tries again to escape FTC audits of X data handling

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Google Requested a Slightly Different AI Statement

The LLM warnings Google fired Timnit Gebru over have all come true

Is Silicon Valley ready to put robots in people’s homes? Hello Robot is.

How some data center operators are tackling their water use problems

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Let us filter AI slop, you cowards

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

AI leaders call for tougher protections against AI-aided bioweapons★ 76

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

TSMC Buys High-NA EUV but Delays Mass Production, Raises Capex to $56B★ 72

New solid-state battery material works from -40°C to 55°C

Nemotron 3 Ultra now available on AI Gateway

Jensen Huang Highlights Harness as a Key AI Agent Architecture Component

Reve 2 and Ideogram 4: Layouts in Imagegen

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it

How we contain Claude across products★ 74

Designing the hf CLI as an agent-optimized way to work with the Hub

Show HN: Mnemo - local-first AI memory layer for any LLM

Google ordered to add clearer AI Search links and let UK publishers opt out★ 78

How LLMs Actually Work

Scaling Past Informal AI - Carina Hong, Axiom Math

Google's Gemma 4 12B is designed to run on 16GB RAM laptops

Trump AI testing plan faces problem: DOGE gutted US security teams

No, Artificial Intelligence Is Not Conscious★ 72

How Embryos Shape Their Limbs: A Key Discovery of Genetic Brakes