Latest in AI

Showing:content-moderationResearchersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

New Bill Takes Aim at Government Pressure to Silence Lawful Online Speech
Hacker News (AI keywords)38 days agoRegulation
The Electronic Frontier Foundation covers a new legislative bill designed to limit government officials' ability to pressure online platforms into removing or restricting otherwise lawful speech. The bill targets informal coercive tactics — often called 'jawboning' — where officials push platforms to moderate content outside of formal legal channels. EFF frames this as a meaningful safeguard for free expression, with implications for platform operators, developers, and users broadly.
South Korean forums may need AI scans for uploaded images and videos
Hacker News (AI keywords)53 days agoRegulation
A Privacy Guides community post says South Korean forums and online communities may be required to scan user-uploaded images and videos with AI under telecom-related rules. The post claims operators must provide their own hardware, including costly Nvidia GPUs. The debate centers on illegal sexual imagery and CSAM prevention, but also raises concerns about prior censorship, false positives, free expression, and burdens on small domestic communities.
Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI
Hugging Face Blog53 days agoRelease
NVIDIA’s Nemotron 3.5 Content Safety is positioned as a customizable multimodal safety layer for global enterprise AI. Based on the title, it appears focused on content moderation and policy enforcement across AI applications, potentially including text and visual contexts. Without the full article, details such as benchmarks, licensing, supported languages, deployment paths, and model specifications should not be assumed.
Let us filter AI slop, you cowards
The Verge AI54 days agoCommentary
The article says AI-generated content has become nearly impossible to avoid online. Platforms such as YouTube, Instagram, and TikTok have expanded authentication efforts and increasingly label AI-made images, videos, and music. The author argues that labels are not enough: if platforms can identify AI content, they should give users controls to filter or reduce it.
Llama Guard 4 正式登陸 Hugging Face Hub：全新一代開源 AI 安全防護模型★ 75
Hugging Face Blog455 days agoRelease
Meta's safety guardrail model family has welcomed its newest member — Llama Guard 4 — which is now officially available on the Hugging Face Hub. As a…
Hugging Face 發布全新內容指南與政策，規範平台模型與數據集安全
Hugging Face Blog1,139 days agoRelease
Hugging Face, the world's largest open-source AI platform, has officially published new "Content Guidelines and Policy," aimed at establishing a safer, more…