Core42 Deploys OpenAI GPT-OSS Models for Sovereign AI Access

This integration supports real-time inference speeds of up to 3,000 tokens per second per user, enabling low-latency AI applications to run efficiently at global scale.

Reading Time: 2 Min 

Topics

  • [Image source: Chetan Jha/MITSMR Middle East]

    Sovereign cloud infrastructure that balances performance with regulatory compliance is increasingly critical for organizations in sectors like healthcare, finance, and national security.

    To meet this growing demand, Core42, a G42 company specializing in sovereign cloud and AI infrastructure has announced the availability of OpenAI’s latest open-weight models, gpt-oss-20B and gpt-oss-120B, on its AI Cloud platform. These models are now accessible through the Core42 Compass API, offering organizations immediate access to high-performance, sovereign-ready AI capabilities.

    The deployment allows enterprises, developers, and researchers to run advanced language models across a range of high-performance silicon platforms, aligning with Core42’s commitment to flexible, secure, and scalable AI infrastructure. The integration supports real-time inference speeds of up to 3,000 tokens per second per user, enabling demanding workloads and low-latency AI applications to run efficiently at global scale.

    “Core42 AI Cloud, powered by silicon-diverse infrastructure, delivers the flexibility and performance needed for today’s AI workloads,” said Kiril Evtimov, CEO of Core42 and Group CTO at G42. “Through the Compass API, organizations can access the latest open-weight AI models and choose the optimal platform to scale transformation, optimize performance and cost, and drive progress across global markets.”

    The announcement emphasizes enterprise autonomy in model deployment and customization, with benefits such as:

    • Enterprise-scale performance for complex automation and decision-making workloads
    • Sovereign-ready scalability, ensuring in-country deployment with full data and infrastructure control
    • Predictable cost-performance alignment for regulated sectors and committed infrastructure environments
    • Cost-effective agentic AI, allowing organizations to run intelligent agents efficiently and securely

    This initiative aligns with Core42’s broader vision to build sovereign, transparent AI systems. It also supports G42’s expanding AI footprint, following milestones such as the launch of the 1GW Stargate UAE facility and Microsoft’s $1.5 billion strategic investment which positions the UAE as a growing global hub for AI innovation.

    Topics

    More Like This

    You must to post a comment.

    First time here? : Comment on articles and get access to many more articles.