Back to Blog
Vitalii

First open-source model GPT-OSS by OpenAI: AI pricing and API availability

GPT-OSS is the first open-weight AI model from OpenAI, with high performance capabilities and reasoning. The models aim to provide enterprise-grade AI functionality that can run efficiently on consumer hardware or single GPUs, making top AI industry expertise from the OpenAI Team accessible to developers, researchers, and organizations of all sizes.

Open Source Benefits

  • License: Apache 2.0 - fully commercial-friendly
  • Cost: $0 for model weights and local inference
  • Customization: Full model modification and fine-tuning rights
  • Distribution: Can be redistributed and integrated into commercial products

Pricing & Availability

Web Interface

OpenAI provides a dedicated web interface where you can test GPT-OSS models at no cost. The platform offers immediate access to both 20B and 120B models without requiring API keys or technical setup.

Features:

  • Free access to all GPT-OSS model variants
  • No registration or API keys required
  • Interactive chat interface with adjustable parameters
  • Real-time model comparison capabilities

Try it now: gpt-oss.com

Cloud Inference Pricing (API)

GPT-OSS 20B Pricing

Provider Inference Context Input Price Output Price Throughput
OpenRouter Fireworks 131K $0.05 $0.20 456.0tps
OpenRouter NovitaAI 131K $0.05 $0.20 268.8tps
OpenRouter Groq 131K $0.10 $0.50 10,850tps
Cloudflare AI @cf/openai/gpt-oss-20b - $0.20 $0.30 -

GPT-OSS 120B Pricing

Provider Inference Context Input Price Output Price Throughput
OpenRouter Baseten 131K $0.10 $0.50 997.4tps
OpenRouter NovitaAI 131K $0.10 $0.50 95.02tps
OpenRouter Fireworks 131K $0.15 $0.60 233.3tps
OpenRouter Together 131K $0.15 $0.60 177.7tps
OpenRouter Parasail 131K $0.15 $0.60 102.9tps
OpenRouter Groq 131K $0.15 $0.75 954.8tps
OpenRouter Cerebras 131K $0.25 $0.69 3,512tps
Cloudflare AI @cf/openai/gpt-oss-120b - $0.35 $0.75 -

Use Cases

1. Enterprise AI on Private Infrastructure

  • Complete data privacy, no API costs, full control
  • Deploy 120B model on single H100 GPU
  • Enterprise-grade reasoning with zero data leaving premises

2. Local Development and Prototyping

  • API rate limits, offline development, cost predictability
  • 20B model running on 16GB consumer hardware
  • Rapid prototyping with production-ready AI capabilities

References:

  1. Introducing GPT-OSS - OpenAI Official Announcement
  2. Welcome OpenAI GPT-OSS - Hugging Face Technical Deep Dive
  3. GPT-OSS 20B on OpenRouter - Pricing and Specifications
  4. GPT-OSS Web Interface - Free Web Interface
  5. GPT-OSS on Ollama - Local Installation

Tags: #OpenAI #GPT-OSS #OpenSource #AI #MoE #Reasoning #LocalAI #Enterprise