Skip to main content

Enterprise Technical Service Pricing

Service Positioning

LaoZhang API provides enterprise-grade AI technical API integration services for enterprises and developers:
  • Service Nature: Information technology service / API integration service
  • Target Customers: Enterprise users, development teams, technical departments
  • Billing Method: Token/per-call billing, pay-as-you-go, no monthly fees

Transparent Pricing

Most model prices are transparent and public, consumption fully trackable

Complete Models, Fast Updates

Whenever manufacturers release new models, LaoZhang API is always quick to update

Enterprise Services

Top-up Plans

  • Standard
  • Enterprise
Minimum top-up $7. Large single top-ups receive up to 10% bonus credit. Enterprise Exclusive Service:
  • Volume discounts for large purchases
  • Invoice support available
  • Contract options
  • Dedicated account manager
  • Priority technical support
Contact:

Cost Estimation Reference

Common Scenario Cost Reference

**API Call Scenario Costs:**Daily Conversation (GPT-4.1-mini)
  • 1000 conversations ≈ $1
  • 30 times per day ≈ $1 per month
Long Text Processing (GPT-4o)
  • Process 100,000 words ≈ $5
  • Document processing ≈ $0.5
Code Development (Claude-4)
  • Generate 10,000 lines of code ≈ $3
  • Cursor daily use ≈ $0.5
Image Generation (GPT-4o-image)
  • Generate 100 images ≈ $1
  • Only $0.01 per image

Usage Monitoring & Management

Console Features

  • Usage Monitoring
  • Key Management
  • Cost Optimization
Real-time Dashboard
  • Daily, weekly, monthly statistics
  • Model usage distribution
  • Detailed cost queries
  • Quota warning settings
View at: api.yelinai.com/log Fine-grained Permission Control
  • Multiple key management
  • Model permission restrictions
  • Quota limit settings
  • Validity period control
Manage at: api.yelinai.com/token Technical Optimization Recommendations
  1. Use max_tokens to limit output
  2. Choose appropriate temperature
  3. Cache frequently used responses
  4. Batch process requests
  5. Use streaming output to reduce timeouts

Billing Modes

LaoZhang API supports two billing modes. When a model supports both modes simultaneously: Per-call billing takes priority over token-based billingIf a model supports both per-call and token-based billing, the system defaults to per-call billing.

API Key Settings Impact

Token-based Only

If token is set to “token-based only”, even if the model supports per-call billing, it will use token-based billing

Default Settings

Tokens support all billing modes by default, system auto-selects (per-call priority)

Per-call Billing Scenarios

The following types of models typically use per-call billing:
  • Image Generation
  • Video Generation
  • Special Models
Applicable Models:
  • sora_image series
  • flux-kontext-pro
  • DALL-E series
Billing Unit: Per image Applicable Models:
  • Video generation APIs
  • Animation production models
Billing Unit: Per video/per second Identification Method:
  • Model names with -all suffix
  • Specific functional models
Billing Unit: Per call View complete model price list: LaoZhang API Pricing

Token Billing Explanation

What are Tokens?

Understanding Tokens Quickly

Tokens are AI’s “vocabulary units”, the basic unit for billing.Simple Memory:
  • 1 Chinese character ≈ 2 tokens
  • 1 English word ≈ 1 token
  • 500 Chinese characters ≈ 1000 tokens
Real Examples:
  • “Hello World” = 2 tokens
  • A typical article (2000 words) ≈ 3000 tokens

Consumption Examples

Use CaseInput TokensOutput TokensTotalCost (gpt-4.1-mini)
Simple Q&A205070$0.00004
Article generation10020002100$0.0013
Code debugging500300800$0.00027
Long text translation5000500010000$0.0035

Prompts and Completions

In each API call, costs consist of two parts: 1 Prompt - Input Tokens All content you send to the model, including:
  • System prompts
  • User questions
  • Context information
  • Chat history (if any)
2 Completion - Output Tokens Content generated by the model, including:
  • Text responses
  • Code generation
  • Structured data
Input and output prices may differ between models. Typically, output tokens cost more than input tokens.

Model Price Reference

  • Best Value TOP5
  • Coding Specific
  • Image Generation
ModelInput PriceOutput PriceFeaturesUse Cases
gpt-4.1-mini$0.15/1M$0.6/1MUltra-fastDaily conversation
deepseek-v3$0.27/1M$1.1/1MChinese modelGeneral tasks
gemini-2.5-flash$0.15/1M$0.6/1MMultimodalImage-text
claude-3-haiku$0.25/1M$1.25/1MReliableText processing
qwen-max$0.5/1M$2/1MChinese optimizedChinese tasks
Recommendation: gpt-4.1-mini, fast, effective, and low-cost
ModelInput PriceOutput PriceCoding AbilityCursor Compatible
claude-sonnet-4$3/1M$15/1MExcellentPerfect
deepseek-coder-v3$0.27/1M$1.1/1MGreatGood
gpt-4.1$2.5/1M$10/1MGreatPerfect
qwen-coder$0.5/1M$2/1MGoodGood
Cursor User First Choice: gpt-4.1 (specially optimized)
ModelPriceSizeSpeedQuality
gpt-4o-image$0.01/image1024x10243sExcellent
flux-kontext-pro$0.04/imageMultiple5sSuperior
dall-e-3$0.04/image1024x102410sExcellent
sora-image$0.01/image1024x10242sGood
Best Value: gpt-4o-image, only a penny per image

Cost Estimation Method

1 Small Sample Testing Test with a small number of samples (5-10) 2 Check Consumption Logs View detailed token consumption for each call in console 3 Calculate Average Calculate average token count per call 4 Estimate Total Cost Average tokens × Expected calls × Model unit price

Practical Recommendations

  1. Test with cheaper models first to verify feasibility
  2. Analyze actual consumption through backend logs
  3. Choose appropriate models based on task complexity
  4. Optimize prompts to reduce unnecessary token consumption

Real-time Price Query

Model Price List

View real-time prices for all models
  • Token-based billing prices
  • Per-call billing prices
  • Discount comparisons

Cost Calculator (In Development)

Quickly estimate usage costs
  • Enter estimated usage
  • Select models to use
  • Auto-calculate costs

FAQ

How to check real-time prices? Log in to LaoZhang API Console, you can view real-time prices for all models on the model list page. How long for top-up to arrive? Top-up arrives instantly, available immediately after successful payment. Do you support invoices? Yes, we support issuing official invoices. Please submit an invoice application in the console. Are there bulk purchase discounts? Enterprise large top-ups can enjoy more discounts. Please contact customer service for consultation.

Enterprise Services & Invoicing

Service Nature

  • Service Type: Information technology service / Technical service
  • Invoice Category: Information technology service fee or Data collection fee
  • Target Customers: Enterprise users, development teams, technical departments

Invoice Process

After customer online top-up or corporate transfer is successful, invoices are issued based on the actual payment amount (all prices include tax). Customers need to provide detailed invoice information: enterprise or university name, tax ID, etc. 1 Submit Invoice Information Website backend top navigation - Invoice, self-service invoice form submissionSubmit Invoice Application → 2 Invoice Type
  • VAT general invoices available (special invoice tax rate needs separate discussion)
  • Invoice category: Information Technology Service Fee or Data Collection Fee
  • Can cooperate to issue: Purchase list with company seal
3 Delivery Time About 1 business day, electronic invoice sent via email This service issues invoices under the nature of technical services, meeting enterprise financial and technical procurement requirements.

Get Started

Register an account, experience 200+ AI model API integration services