OpenAI launches prompt-based safety policies and gpt-oss-safeguard model to help developers build age-appropriate AI protections for teenage users. (Read More)OpenAI launches prompt-based safety policies and gpt-oss-safeguard model to help developers build age-appropriate AI protections for teenage users. (Read More)

OpenAI Releases Open-Source Teen Safety Tools for AI Developers

2026/03/25 02:42
3 min read
For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

OpenAI Releases Open-Source Teen Safety Tools for AI Developers

Luisa Crawford Mar 24, 2026 18:42

OpenAI launches prompt-based safety policies and gpt-oss-safeguard model to help developers build age-appropriate AI protections for teenage users.

OpenAI Releases Open-Source Teen Safety Tools for AI Developers

OpenAI dropped a new toolkit on March 24 aimed squarely at one of AI's thorniest problems: keeping teenage users safe without neutering the technology's usefulness. The release includes prompt-based safety policies designed to work with gpt-oss-safeguard, the company's open-weight safety model available on Hugging Face.

The policies target six risk categories that disproportionately affect younger users: graphic violent and sexual content, harmful body ideals, dangerous challenges, romantic or violent roleplay, and age-restricted goods and services. Developers can plug these prompts directly into their content moderation systems for real-time filtering or batch analysis.

Why This Matters for the AI Ecosystem

Most developers building AI applications face a frustrating gap between knowing they need teen safety measures and actually implementing them. Translating "protect kids from harmful content" into operational code requires both child development expertise and deep technical knowledge—a combination few teams possess.

"One of the biggest gaps in AI safety for teens has been the lack of clear, operational policies that developers can build from," said Robbie Torney, Head of AI & Digital Assessments at Common Sense Media, who helped shape the policies. "Many times, developers are starting from scratch."

The timing feels relevant given recent Microsoft research from February showing that single benign-sounding prompts can systematically strip safety guardrails from major language models. That vulnerability makes robust, well-tested safety policies more valuable—developers can't just wing it.

What's Actually in the Release

OpenAI structured these policies as prompts rather than hard-coded rules, which means developers can adapt them to specific use cases and iterate over time. The company worked with Common Sense Media and everyone.ai to define edge cases and refine the policy language.

Dr. Mathilde Cerioli, Chief Scientist at everyone.ai, noted that content filtering is just the starting point. Her team has already built on this work to create behavioral policies addressing risks like "exclusivity and overreliance"—the tendency of AI systems to become too central to a teen's social or emotional life.

The policies are being released through the ROOST Model Community on GitHub, explicitly inviting the developer community to translate them into other languages and extend coverage to additional risk areas.

The Limitations

OpenAI is clear these policies represent a floor, not a ceiling. The company explicitly states they don't reflect the full extent of its internal safeguards and shouldn't be treated as comprehensive teen safety solutions.

"Each application has unique risks, audiences and contexts," the release notes. Developers still need to layer these policies with product design decisions, user controls, monitoring systems, and what OpenAI calls "teen-friendly transparency."

This release builds on OpenAI's broader push for youth protection, including the Model Spec's Under-18 principles, parental controls in ChatGPT, and the Teen Safety Blueprint the company has been promoting as an industry standard. Whether competitors adopt similar open-source approaches will determine if this becomes a genuine ecosystem improvement or just an OpenAI talking point.

Image source: Shutterstock
  • openai
  • ai safety
  • teen protection
  • open source
  • gpt-oss-safeguard
Market Opportunity
Prompt Logo
Prompt Price(PROMPT)
$0.03394
$0.03394$0.03394
+1.46%
USD
Prompt (PROMPT) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

BitGo expands its presence in Europe

BitGo expands its presence in Europe

The post BitGo expands its presence in Europe appeared on BitcoinEthereumNews.com. BitGo, global leader in digital asset infrastructure, announces a significant expansion of its presence in Europe. The company, through its subsidiary BitGo Europe GmbH, has obtained an extension of the license from BaFin (German Federal Financial Supervisory Authority), allowing it to offer regulated cryptocurrency trading services directly from Frankfurt, Germany. This move marks a decisive step for the European digital asset market, offering institutional investors the opportunity to access secure, regulated cryptocurrency trading integrated with advanced custody and management services. A comprehensive offering for European institutional investors With the extension of the license according to the MiCA (Markets in Crypto-Assets) regulation, initially obtained in May 2025, BitGo Europe expands the range of services available for European investors. Now, in addition to custody, staking, and transfer of digital assets, the platform also offers a spot trading service on thousands of cryptocurrencies and stablecoins. Institutional investors can now leverage BitGo’s OTC desk and a high-performance electronic trading platform, designed to ensure fast, secure, and transparent transactions. Aggregated access to numerous liquidity sources, including leading market makers and exchanges, allows for trading at competitive prices and high-quality executions. Security and Regulation at the Core of BitGo’s Strategy According to Brett Reeves, Head of European Sales and Go Network at BitGo, the goal is clear: “We are excited to strengthen our European platform and enable our clients to operate smoothly, competitively, and securely.§By combining our institutional custody solution with high-performance trading execution, clients will be able to access deep liquidity with the peace of mind that their assets will remain in cold storage, under regulated custody and compliant with MiCA.” The security of digital assets is indeed one of the cornerstones of BitGo’s offering. All services are designed to ensure that investors’ assets remain protected in regulated cold storage, minimizing operational and counterparty risks.…
Share
BitcoinEthereumNews2025/09/18 04:28
Wormhole launches reserve tying protocol revenue to token

Wormhole launches reserve tying protocol revenue to token

The post Wormhole launches reserve tying protocol revenue to token appeared on BitcoinEthereumNews.com. Wormhole is changing how its W token works by creating a new reserve designed to hold value for the long term. Announced on Wednesday, the Wormhole Reserve will collect onchain and offchain revenues and other value generated across the protocol and its applications (including Portal) and accumulate them into W, locking the tokens within the reserve. The reserve is part of a broader update called W 2.0. Other changes include a 4% targeted base yield for tokenholders who stake and take part in governance. While staking rewards will vary, Wormhole said active users of ecosystem apps can earn boosted yields through features like Portal Earn. The team stressed that no new tokens are being minted; rewards come from existing supply and protocol revenues, keeping the cap fixed at 10 billion. Wormhole is also overhauling its token release schedule. Instead of releasing large amounts of W at once under the old “cliff” model, the network will shift to steady, bi-weekly unlocks starting October 3, 2025. The aim is to avoid sharp periods of selling pressure and create a more predictable environment for investors. Lockups for some groups, including validators and investors, will extend an additional six months, until October 2028. Core contributor tokens remain under longer contractual time locks. Wormhole launched in 2020 as a cross-chain bridge and now connects more than 40 blockchains. The W token powers governance and staking, with a capped supply of 10 billion. By redirecting fees and revenues into the new reserve, Wormhole is betting that its token can maintain value as demand for moving assets and data between chains grows. This is a developing story. This article was generated with the assistance of AI and reviewed by editor Jeffrey Albus before publication. Get the news in your inbox. Explore Blockworks newsletters: Source: https://blockworks.co/news/wormhole-launches-reserve
Share
BitcoinEthereumNews2025/09/18 01:55
SlowMist: Attackers have stolen approximately 300GB of data due to the LiteLLM vulnerability. Encryption developers are advised to conduct an immediate self-check.

SlowMist: Attackers have stolen approximately 300GB of data due to the LiteLLM vulnerability. Encryption developers are advised to conduct an immediate self-check.

PANews reported on March 25th that 23pds, Chief Information Security Officer of SlowMist Technology, issued another warning regarding the LiteLLM attack: "All cryptocurrency
Share
PANews2026/03/25 10:30