| ACTIONS_ID_TOKEN_REQUEST_TOKEN | Token for requesting ID in GitHub Actions | 
| ACTIONS_ID_TOKEN_REQUEST_URL | URL for requesting ID token in GitHub Actions | 
| AGENTOPS_ENVIRONMENT | Environment for AgentOps logging integration | 
| AGENTOPS_API_KEY | API Key for AgentOps logging integration | 
| AGENTOPS_SERVICE_NAME | Service Name for AgentOps logging integration | 
| AISPEND_ACCOUNT_ID | Account ID for AI Spend | 
| AISPEND_API_KEY | API Key for AI Spend | 
| AIOHTTP_CONNECTOR_LIMIT | Connection limit for aiohttp connector. When set to 0, no limit is applied. Default is 0 | 
| AIOHTTP_KEEPALIVE_TIMEOUT | Keep-alive timeout for aiohttp connections in seconds. Default is 120 | 
| AIOHTTP_TRUST_ENV | Flag to enable aiohttp trust environment. When this is set to True, aiohttp will respect HTTP(S)_PROXY env vars. Default is False | 
| AIOHTTP_TTL_DNS_CACHE | DNS cache time-to-live for aiohttp in seconds. Default is 300 | 
| ALLOWED_EMAIL_DOMAINS | List of email domains allowed for access | 
| ARIZE_API_KEY | API key for Arize platform integration | 
| ARIZE_SPACE_KEY | Space key for Arize platform | 
| ARGILLA_BATCH_SIZE | Batch size for Argilla logging | 
| ARGILLA_API_KEY | API key for Argilla platform | 
| ARGILLA_SAMPLING_RATE | Sampling rate for Argilla logging | 
| ARGILLA_DATASET_NAME | Dataset name for Argilla logging | 
| ARGILLA_BASE_URL | Base URL for Argilla service | 
| ATHINA_API_KEY | API key for Athina service | 
| ATHINA_BASE_URL | Base URL for Athina service (defaults to https://log.athina.ai) | 
| AUTH_STRATEGY | Strategy used for authentication (e.g., OAuth, API key) | 
| ANTHROPIC_API_KEY | API key for Anthropic service | 
| ANTHROPIC_API_BASE | Base URL for Anthropic API. Default is https://api.anthropic.com | 
| AWS_ACCESS_KEY_ID | Access Key ID for AWS services | 
| AWS_BATCH_ROLE_ARN | ARN of the AWS IAM role for batch operations | 
| AWS_DEFAULT_REGION | Default AWS region for service interactions when AWS_REGION is not set | 
| AWS_PROFILE_NAME | AWS CLI profile name to be used | 
| AWS_REGION | AWS region for service interactions (takes precedence over AWS_DEFAULT_REGION) | 
| AWS_REGION_NAME | Default AWS region for service interactions | 
| AWS_ROLE_ARN | ARN of the AWS IAM role to assume for authentication | 
| AWS_ROLE_NAME | Role name for AWS IAM usage | 
| AWS_S3_BUCKET_NAME | Name of the AWS S3 bucket for file operations | 
| AWS_S3_OUTPUT_BUCKET_NAME | Name of the AWS S3 output bucket for batch operations | 
| AWS_SECRET_ACCESS_KEY | Secret Access Key for AWS services | 
| AWS_SESSION_NAME | Name for AWS session | 
| AWS_WEB_IDENTITY_TOKEN | Web identity token for AWS | 
| AWS_WEB_IDENTITY_TOKEN_FILE | Path to file containing web identity token for AWS | 
| AZURE_API_VERSION | Version of the Azure API being used | 
| AZURE_AUTHORITY_HOST | Azure authority host URL | 
| AZURE_CERTIFICATE_PASSWORD | Password for Azure OpenAI certificate | 
| AZURE_CLIENT_ID | Client ID for Azure services | 
| AZURE_CLIENT_SECRET | Client secret for Azure services | 
| AZURE_CODE_INTERPRETER_COST_PER_SESSION | Cost per session for Azure Code Interpreter service | 
| AZURE_COMPUTER_USE_INPUT_COST_PER_1K_TOKENS | Input cost per 1K tokens for Azure Computer Use service | 
| AZURE_COMPUTER_USE_OUTPUT_COST_PER_1K_TOKENS | Output cost per 1K tokens for Azure Computer Use service | 
| AZURE_DEFAULT_RESPONSES_API_VERSION | Version of the Azure Default Responses API being used. Default is "preview" | 
| AZURE_TENANT_ID | Tenant ID for Azure Active Directory | 
| AZURE_USERNAME | Username for Azure services, use in conjunction with AZURE_PASSWORD for azure ad token with basic username/password workflow | 
| AZURE_PASSWORD | Password for Azure services, use in conjunction with AZURE_USERNAME for azure ad token with basic username/password workflow | 
| AZURE_FEDERATED_TOKEN_FILE | File path to Azure federated token | 
| AZURE_FILE_SEARCH_COST_PER_GB_PER_DAY | Cost per GB per day for Azure File Search service | 
| AZURE_SCOPE | For EntraID Auth, Scope for Azure services, defaults to "https://cognitiveservices.azure.com/.default" | 
| AZURE_KEY_VAULT_URI | URI for Azure Key Vault | 
| AZURE_OPERATION_POLLING_TIMEOUT | Timeout in seconds for Azure operation polling | 
| AZURE_STORAGE_ACCOUNT_KEY | The Azure Storage Account Key to use for Authentication to Azure Blob Storage logging | 
| AZURE_STORAGE_ACCOUNT_NAME | Name of the Azure Storage Account to use for logging to Azure Blob Storage | 
| AZURE_STORAGE_FILE_SYSTEM | Name of the Azure Storage File System to use for logging to Azure Blob Storage.  (Typically the Container name) | 
| AZURE_STORAGE_TENANT_ID | The Application Tenant ID to use for Authentication to Azure Blob Storage logging | 
| AZURE_STORAGE_CLIENT_ID | The Application Client ID to use for Authentication to Azure Blob Storage logging | 
| AZURE_STORAGE_CLIENT_SECRET | The Application Client Secret to use for Authentication to Azure Blob Storage logging | 
| AZURE_VECTOR_STORE_COST_PER_GB_PER_DAY | Cost per GB per day for Azure Vector Store service | 
| BATCH_STATUS_POLL_INTERVAL_SECONDS | Interval in seconds for polling batch status. Default is 3600 (1 hour) | 
| BATCH_STATUS_POLL_MAX_ATTEMPTS | Maximum number of attempts for polling batch status. Default is 24 (for 24 hours) | 
| BEDROCK_MAX_POLICY_SIZE | Maximum size for Bedrock policy. Default is 75 | 
| BERRISPEND_ACCOUNT_ID | Account ID for BerriSpend service | 
| BRAINTRUST_API_KEY | API key for Braintrust integration | 
| BRAINTRUST_API_BASE | Base URL for Braintrust API. Default is https://api.braintrustdata.com/v1 | 
| CACHED_STREAMING_CHUNK_DELAY | Delay in seconds for cached streaming chunks. Default is 0.02 | 
| CIRCLE_OIDC_TOKEN | OpenID Connect token for CircleCI | 
| CIRCLE_OIDC_TOKEN_V2 | Version 2 of the OpenID Connect token for CircleCI | 
| CLOUDZERO_API_KEY | CloudZero API key for authentication | 
| CLOUDZERO_CONNECTION_ID | CloudZero connection ID for data submission | 
| CLOUDZERO_EXPORT_INTERVAL_MINUTES | Interval in minutes for CloudZero data export operations | 
| CLOUDZERO_MAX_FETCHED_DATA_RECORDS | Maximum number of data records to fetch from CloudZero | 
| CLOUDZERO_TIMEZONE | Timezone for date handling (default: UTC) | 
| CONFIG_FILE_PATH | File path for configuration file | 
| CONFIDENT_API_KEY | API key for DeepEval integration | 
| CUSTOM_TIKTOKEN_CACHE_DIR | Custom directory for Tiktoken cache | 
| CONFIDENT_API_KEY | API key for Confident AI (Deepeval) Logging service | 
| COHERE_API_BASE | Base URL for Cohere API. Default is https://api.cohere.com | 
| DATABASE_HOST | Hostname for the database server | 
| DATABASE_NAME | Name of the database | 
| DATABASE_PASSWORD | Password for the database user | 
| DATABASE_PORT | Port number for database connection | 
| DATABASE_SCHEMA | Schema name used in the database | 
| DATABASE_URL | Connection URL for the database | 
| DATABASE_USER | Username for database connection | 
| DATABASE_USERNAME | Alias for database user | 
| DATABRICKS_API_BASE | Base URL for Databricks API | 
| DAYS_IN_A_MONTH | Days in a month for calculation purposes. Default is 28 | 
| DAYS_IN_A_WEEK | Days in a week for calculation purposes. Default is 7 | 
| DAYS_IN_A_YEAR | Days in a year for calculation purposes. Default is 365 | 
| DYNAMOAI_API_KEY | API key for DynamoAI Guardrails service | 
| DYNAMOAI_API_BASE | Base URL for DynamoAI API. Default is https://api.dynamo.ai | 
| DYNAMOAI_MODEL_ID | Model ID for DynamoAI tracking/logging purposes | 
| DYNAMOAI_POLICY_IDS | Comma-separated list of DynamoAI policy IDs to apply | 
| DD_BASE_URL | Base URL for Datadog integration | 
| DATADOG_BASE_URL | (Alternative to DD_BASE_URL) Base URL for Datadog integration | 
| _DATADOG_BASE_URL | (Alternative to DD_BASE_URL) Base URL for Datadog integration | 
| DD_API_KEY | API key for Datadog integration | 
| DD_SITE | Site URL for Datadog (e.g., datadoghq.com) | 
| DD_SOURCE | Source identifier for Datadog logs | 
| DD_TRACER_STREAMING_CHUNK_YIELD_RESOURCE | Resource name for Datadog tracing of streaming chunk yields. Default is "streaming.chunk.yield" | 
| DD_ENV | Environment identifier for Datadog logs. Only supported for datadog_llm_observabilitycallback | 
| DD_SERVICE | Service identifier for Datadog logs. Defaults to "litellm-server" | 
| DD_VERSION | Version identifier for Datadog logs. Defaults to "unknown" | 
| DEBUG_OTEL | Enable debug mode for OpenTelemetry | 
| DEFAULT_ALLOWED_FAILS | Maximum failures allowed before cooling down a model. Default is 3 | 
| DEFAULT_ANTHROPIC_CHAT_MAX_TOKENS | Default maximum tokens for Anthropic chat completions. Default is 4096 | 
| DEFAULT_BATCH_SIZE | Default batch size for operations. Default is 512 | 
| DEFAULT_CLIENT_DISCONNECT_CHECK_TIMEOUT_SECONDS | Timeout in seconds for checking client disconnection. Default is 1 | 
| DEFAULT_COOLDOWN_TIME_SECONDS | Duration in seconds to cooldown a model after failures. Default is 5 | 
| DEFAULT_CRON_JOB_LOCK_TTL_SECONDS | Time-to-live for cron job locks in seconds. Default is 60 (1 minute) | 
| DEFAULT_DATAFORSEO_LOCATION_CODE | Default location code for DataForSEO search API. Default is 2250 (France) | 
| DEFAULT_FAILURE_THRESHOLD_PERCENT | Threshold percentage of failures to cool down a deployment. Default is 0.5 (50%) | 
| DEFAULT_FLUSH_INTERVAL_SECONDS | Default interval in seconds for flushing operations. Default is 5 | 
| DEFAULT_HEALTH_CHECK_INTERVAL | Default interval in seconds for health checks. Default is 300 (5 minutes) | 
| DEFAULT_IMAGE_HEIGHT | Default height for images. Default is 300 | 
| DEFAULT_IMAGE_TOKEN_COUNT | Default token count for images. Default is 250 | 
| DEFAULT_IMAGE_WIDTH | Default width for images. Default is 300 | 
| DEFAULT_IN_MEMORY_TTL | Default time-to-live for in-memory cache in seconds. Default is 5 | 
| DEFAULT_MANAGEMENT_OBJECT_IN_MEMORY_CACHE_TTL | Default time-to-live in seconds for management objects (User, Team, Key, Organization) in memory cache. Default is 60 seconds. | 
| DEFAULT_MAX_LRU_CACHE_SIZE | Default maximum size for LRU cache. Default is 16 | 
| DEFAULT_MAX_RECURSE_DEPTH | Default maximum recursion depth. Default is 100 | 
| DEFAULT_MAX_RECURSE_DEPTH_SENSITIVE_DATA_MASKER | Default maximum recursion depth for sensitive data masker. Default is 10 | 
| DEFAULT_MAX_RETRIES | Default maximum retry attempts. Default is 2 | 
| DEFAULT_MAX_TOKENS | Default maximum tokens for LLM calls. Default is 4096 | 
| DEFAULT_MAX_TOKENS_FOR_TRITON | Default maximum tokens for Triton models. Default is 2000 | 
| DEFAULT_MAX_REDIS_BATCH_CACHE_SIZE | Default maximum size for redis batch cache. Default is 1000 | 
| DEFAULT_MOCK_RESPONSE_COMPLETION_TOKEN_COUNT | Default token count for mock response completions. Default is 20 | 
| DEFAULT_MOCK_RESPONSE_PROMPT_TOKEN_COUNT | Default token count for mock response prompts. Default is 10 | 
| DEFAULT_MODEL_CREATED_AT_TIME | Default creation timestamp for models. Default is 1677610602 | 
| DEFAULT_NUM_WORKERS_LITELLM_PROXY | Default number of workers for LiteLLM proxy. Default is 4. We strongly recommend setting NUM Workers to Number of vCPUs available | 
| DEFAULT_PROMPT_INJECTION_SIMILARITY_THRESHOLD | Default threshold for prompt injection similarity. Default is 0.7 | 
| DEFAULT_POLLING_INTERVAL | Default polling interval for schedulers in seconds. Default is 0.03 | 
| DEFAULT_REASONING_EFFORT_DISABLE_THINKING_BUDGET | Default reasoning effort disable thinking budget. Default is 0 | 
| DEFAULT_REASONING_EFFORT_HIGH_THINKING_BUDGET | Default high reasoning effort thinking budget. Default is 4096 | 
| DEFAULT_REASONING_EFFORT_LOW_THINKING_BUDGET | Default low reasoning effort thinking budget. Default is 1024 | 
| DEFAULT_REASONING_EFFORT_MEDIUM_THINKING_BUDGET | Default medium reasoning effort thinking budget. Default is 2048 | 
| DEFAULT_REASONING_EFFORT_MINIMAL_THINKING_BUDGET | Default minimal reasoning effort thinking budget. Default is 512 | 
| DEFAULT_REASONING_EFFORT_MINIMAL_THINKING_BUDGET_GEMINI_2_5_FLASH | Default minimal reasoning effort thinking budget for Gemini 2.5 Flash. Default is 512 | 
| DEFAULT_REASONING_EFFORT_MINIMAL_THINKING_BUDGET_GEMINI_2_5_FLASH_LITE | Default minimal reasoning effort thinking budget for Gemini 2.5 Flash Lite. Default is 512 | 
| DEFAULT_REASONING_EFFORT_MINIMAL_THINKING_BUDGET_GEMINI_2_5_PRO | Default minimal reasoning effort thinking budget for Gemini 2.5 Pro. Default is 512 | 
| DEFAULT_REDIS_SYNC_INTERVAL | Default Redis synchronization interval in seconds. Default is 1 | 
| DEFAULT_REPLICATE_GPU_PRICE_PER_SECOND | Default price per second for Replicate GPU. Default is 0.001400 | 
| DEFAULT_REPLICATE_POLLING_DELAY_SECONDS | Default delay in seconds for Replicate polling. Default is 1 | 
| DEFAULT_REPLICATE_POLLING_RETRIES | Default number of retries for Replicate polling. Default is 5 | 
| DEFAULT_SQS_BATCH_SIZE | Default batch size for SQS logging. Default is 512 | 
| DEFAULT_SQS_FLUSH_INTERVAL_SECONDS | Default flush interval for SQS logging. Default is 10 | 
| DEFAULT_S3_BATCH_SIZE | Default batch size for S3 logging. Default is 512 | 
| DEFAULT_S3_FLUSH_INTERVAL_SECONDS | Default flush interval for S3 logging. Default is 10 | 
| DEFAULT_SLACK_ALERTING_THRESHOLD | Default threshold for Slack alerting. Default is 300 | 
| DEFAULT_SOFT_BUDGET | Default soft budget for LiteLLM proxy keys. Default is 50.0 | 
| DEFAULT_TRIM_RATIO | Default ratio of tokens to trim from prompt end. Default is 0.75 | 
| DIRECT_URL | Direct URL for service endpoint | 
| DISABLE_ADMIN_UI | Toggle to disable the admin UI | 
| DISABLE_AIOHTTP_TRANSPORT | Flag to disable aiohttp transport. When this is set to True, litellm will use httpx instead of aiohttp. Default is False | 
| DISABLE_AIOHTTP_TRUST_ENV | Flag to disable aiohttp trust environment. When this is set to True, litellm will not trust the environment for aiohttp eg. HTTP_PROXYandHTTPS_PROXYenvironment variables will not be used when this is set to True. Default is False | 
| DISABLE_SCHEMA_UPDATE | Toggle to disable schema updates | 
| DYNAMIC_RATE_LIMIT_ERROR_THRESHOLD_PER_MINUTE | Threshold for deployment failures per minute before enforcing rate limits in parallel request limiter. Default is 1 | 
| DOCS_DESCRIPTION | Description text for documentation pages | 
| DOCS_FILTERED | Flag indicating filtered documentation | 
| DOCS_TITLE | Title of the documentation pages | 
| DOCS_URL | The path to the Swagger API documentation. By default this is "/" | 
| EMAIL_LOGO_URL | URL for the logo used in emails | 
| EMAIL_SUPPORT_CONTACT | Support contact email address | 
| EMAIL_SIGNATURE | Custom HTML footer/signature for all emails. Can include HTML tags for formatting and links. | 
| EMAIL_SUBJECT_INVITATION | Custom subject template for invitation emails. | 
| EMAIL_SUBJECT_KEY_CREATED | Custom subject template for key creation emails. | 
| ENKRYPTAI_API_BASE | Base URL for EnkryptAI Guardrails API. Default is https://api.enkryptai.com | 
| ENKRYPTAI_API_KEY | API key for EnkryptAI Guardrails service | 
| EXPERIMENTAL_MULTI_INSTANCE_RATE_LIMITING | Flag to enable new multi-instance rate limiting. Default is False | 
| FIREWORKS_AI_4_B | Size parameter for Fireworks AI 4B model. Default is 4 | 
| FIREWORKS_AI_16_B | Size parameter for Fireworks AI 16B model. Default is 16 | 
| FIREWORKS_AI_56_B_MOE | Size parameter for Fireworks AI 56B MOE model. Default is 56 | 
| FIREWORKS_AI_80_B | Size parameter for Fireworks AI 80B model. Default is 80 | 
| FIREWORKS_AI_176_B_MOE | Size parameter for Fireworks AI 176B MOE model. Default is 176 | 
| FUNCTION_DEFINITION_TOKEN_COUNT | Token count for function definitions. Default is 9 | 
| GALILEO_BASE_URL | Base URL for Galileo platform | 
| GALILEO_PASSWORD | Password for Galileo authentication | 
| GALILEO_PROJECT_ID | Project ID for Galileo usage | 
| GALILEO_USERNAME | Username for Galileo authentication | 
| GOOGLE_SECRET_MANAGER_PROJECT_ID | Project ID for Google Secret Manager | 
| GCS_BUCKET_NAME | Name of the Google Cloud Storage bucket | 
| GCS_PATH_SERVICE_ACCOUNT | Path to the Google Cloud service account JSON file | 
| GCS_FLUSH_INTERVAL | Flush interval for GCS logging (in seconds). Specify how often you want a log to be sent to GCS. Default is 20 seconds | 
| GCS_BATCH_SIZE | Batch size for GCS logging. Specify after how many logs you want to flush to GCS. If BATCH_SIZEis set to 10, logs are flushed every 10 logs. Default is 2048 | 
| GCS_PUBSUB_TOPIC_ID | PubSub Topic ID to send LiteLLM SpendLogs to. | 
| GCS_PUBSUB_PROJECT_ID | PubSub Project ID to send LiteLLM SpendLogs to. | 
| GENERIC_AUTHORIZATION_ENDPOINT | Authorization endpoint for generic OAuth providers | 
| GENERIC_CLIENT_ID | Client ID for generic OAuth providers | 
| GENERIC_CLIENT_SECRET | Client secret for generic OAuth providers | 
| GENERIC_CLIENT_STATE | State parameter for generic client authentication | 
| GENERIC_CLIENT_USE_PKCE | Enable PKCE (Proof Key for Code Exchange) for generic OAuth providers. Set to "true" when your OAuth provider requires PKCE. Default is false | 
| GENERIC_SSO_HEADERS | Comma-separated list of additional headers to add to the request - e.g. Authorization=Bearer <token>, Content-Type=application/json, etc. | 
| GENERIC_INCLUDE_CLIENT_ID | Include client ID in requests for OAuth | 
| GENERIC_SCOPE | Scope settings for generic OAuth providers | 
| GENERIC_TOKEN_ENDPOINT | Token endpoint for generic OAuth providers | 
| GENERIC_USER_DISPLAY_NAME_ATTRIBUTE | Attribute for user's display name in generic auth | 
| GENERIC_USER_EMAIL_ATTRIBUTE | Attribute for user's email in generic auth | 
| GENERIC_USER_FIRST_NAME_ATTRIBUTE | Attribute for user's first name in generic auth | 
| GENERIC_USER_ID_ATTRIBUTE | Attribute for user ID in generic auth | 
| GENERIC_USER_LAST_NAME_ATTRIBUTE | Attribute for user's last name in generic auth | 
| GENERIC_USER_PROVIDER_ATTRIBUTE | Attribute specifying the user's provider | 
| GENERIC_USER_ROLE_ATTRIBUTE | Attribute specifying the user's role | 
| GENERIC_USERINFO_ENDPOINT | Endpoint to fetch user information in generic OAuth | 
| GEMINI_API_BASE | Base URL for Gemini API. Default is https://generativelanguage.googleapis.com | 
| GALILEO_BASE_URL | Base URL for Galileo platform | 
| GALILEO_PASSWORD | Password for Galileo authentication | 
| GALILEO_PROJECT_ID | Project ID for Galileo usage | 
| GALILEO_USERNAME | Username for Galileo authentication | 
| GITHUB_COPILOT_TOKEN_DIR | Directory to store GitHub Copilot token for github_copilotllm provider | 
| GITHUB_COPILOT_API_KEY_FILE | File to store GitHub Copilot API key for github_copilotllm provider | 
| GITHUB_COPILOT_ACCESS_TOKEN_FILE | File to store GitHub Copilot access token for github_copilotllm provider | 
| GREENSCALE_API_KEY | API key for Greenscale service | 
| GREENSCALE_ENDPOINT | Endpoint URL for Greenscale service | 
| GRAYSWAN_API_BASE | Base URL for GraySwan API. Default is https://api.grayswan.ai | 
| GRAYSWAN_API_KEY | API key for GraySwan Cygnal service | 
| GOOGLE_APPLICATION_CREDENTIALS | Path to Google Cloud credentials JSON file | 
| GOOGLE_CLIENT_ID | Client ID for Google OAuth | 
| GOOGLE_CLIENT_SECRET | Client secret for Google OAuth | 
| GOOGLE_KMS_RESOURCE_NAME | Name of the resource in Google KMS | 
| GUARDRAILS_AI_API_BASE | Base URL for Guardrails AI API | 
| HEALTH_CHECK_TIMEOUT_SECONDS | Timeout in seconds for health checks. Default is 60 | 
| HEROKU_API_BASE | Base URL for Heroku API | 
| HEROKU_API_KEY | API key for Heroku services | 
| HF_API_BASE | Base URL for Hugging Face API | 
| HCP_VAULT_ADDR | Address for Hashicorp Vault Secret Manager | 
| HCP_VAULT_CLIENT_CERT | Path to client certificate for Hashicorp Vault Secret Manager | 
| HCP_VAULT_CLIENT_KEY | Path to client key for Hashicorp Vault Secret Manager | 
| HCP_VAULT_NAMESPACE | Namespace for Hashicorp Vault Secret Manager | 
| HCP_VAULT_TOKEN | Token for Hashicorp Vault Secret Manager | 
| HCP_VAULT_CERT_ROLE | Role for Hashicorp Vault Secret Manager Auth | 
| HELICONE_API_KEY | API key for Helicone service | 
| HELICONE_API_BASE | Base URL for Helicone service, defaults to https://api.helicone.ai | 
| HOSTNAME | Hostname for the server, this will be emitted to datadoglogs | 
| HOURS_IN_A_DAY | Hours in a day for calculation purposes. Default is 24 | 
| HUGGINGFACE_API_BASE | Base URL for Hugging Face API | 
| HUGGINGFACE_API_KEY | API key for Hugging Face API | 
| HUMANLOOP_PROMPT_CACHE_TTL_SECONDS | Time-to-live in seconds for cached prompts in Humanloop. Default is 60 | 
| IAM_TOKEN_DB_AUTH | IAM token for database authentication | 
| IBM_GUARDRAILS_API_BASE | Base URL for IBM Guardrails API | 
| IBM_GUARDRAILS_AUTH_TOKEN | Authorization bearer token for IBM Guardrails API | 
| INITIAL_RETRY_DELAY | Initial delay in seconds for retrying requests. Default is 0.5 | 
| JITTER | Jitter factor for retry delay calculations. Default is 0.75 | 
| JSON_LOGS | Enable JSON formatted logging | 
| JWT_AUDIENCE | Expected audience for JWT tokens | 
| JWT_PUBLIC_KEY_URL | URL to fetch public key for JWT verification | 
| LAGO_API_BASE | Base URL for Lago API | 
| LAGO_API_CHARGE_BY | Parameter to determine charge basis in Lago | 
| LAGO_API_EVENT_CODE | Event code for Lago API events | 
| LAGO_API_KEY | API key for accessing Lago services | 
| LANGFUSE_DEBUG | Toggle debug mode for Langfuse | 
| LANGFUSE_FLUSH_INTERVAL | Interval for flushing Langfuse logs | 
| LANGFUSE_TRACING_ENVIRONMENT | Environment for Langfuse tracing | 
| LANGFUSE_HOST | Host URL for Langfuse service | 
| LANGFUSE_PUBLIC_KEY | Public key for Langfuse authentication | 
| LANGFUSE_RELEASE | Release version of Langfuse integration | 
| LANGFUSE_SECRET_KEY | Secret key for Langfuse authentication | 
| LANGSMITH_API_KEY | API key for Langsmith platform | 
| LANGSMITH_BASE_URL | Base URL for Langsmith service | 
| LANGSMITH_BATCH_SIZE | Batch size for operations in Langsmith | 
| LANGSMITH_DEFAULT_RUN_NAME | Default name for Langsmith run | 
| LANGSMITH_PROJECT | Project name for Langsmith integration | 
| LANGSMITH_SAMPLING_RATE | Sampling rate for Langsmith logging | 
| LANGTRACE_API_KEY | API key for Langtrace service | 
| LASSO_API_BASE | Base URL for Lasso API | 
| LASSO_API_KEY | API key for Lasso service | 
| LASSO_USER_ID | User ID for Lasso service | 
| LASSO_CONVERSATION_ID | Conversation ID for Lasso service | 
| LENGTH_OF_LITELLM_GENERATED_KEY | Length of keys generated by LiteLLM. Default is 16 | 
| LEGACY_MULTI_INSTANCE_RATE_LIMITING | Flag to enable legacy multi-instance rate limiting. Default is False | 
| LITERAL_API_KEY | API key for Literal integration | 
| LITERAL_API_URL | API URL for Literal service | 
| LITERAL_BATCH_SIZE | Batch size for Literal operations | 
| LITELLM_ANTHROPIC_DISABLE_URL_SUFFIX | Disable automatic URL suffix appending for Anthropic API base URLs. When set to true, prevents LiteLLM from automatically adding/v1/messagesor/v1/completeto custom Anthropic API endpoints | 
| LITELLM_DONT_SHOW_FEEDBACK_BOX | Flag to hide feedback box in LiteLLM UI | 
| LITELLM_DROP_PARAMS | Parameters to drop in LiteLLM requests | 
| LITELLM_MODIFY_PARAMS | Parameters to modify in LiteLLM requests | 
| LITELLM_EMAIL | Email associated with LiteLLM account | 
| LITELLM_GLOBAL_MAX_PARALLEL_REQUEST_RETRIES | Maximum retries for parallel requests in LiteLLM | 
| LITELLM_GLOBAL_MAX_PARALLEL_REQUEST_RETRY_TIMEOUT | Timeout for retries of parallel requests in LiteLLM | 
| LITELLM_MIGRATION_DIR | Custom migrations directory for prisma migrations, used for baselining db in read-only file systems. | 
| LITELLM_HOSTED_UI | URL of the hosted UI for LiteLLM | 
| LITELM_ENVIRONMENT | Environment of LiteLLM Instance, used by logging services. Currently only used by DeepEval. | 
| LITELLM_KEY_ROTATION_ENABLED | Enable auto-key rotation for LiteLLM (boolean). Default is false. | 
| LITELLM_KEY_ROTATION_CHECK_INTERVAL_SECONDS | Interval in seconds for how often to run job that auto-rotates keys. Default is 86400 (24 hours). | 
| LITELLM_LICENSE | License key for LiteLLM usage | 
| LITELLM_LOCAL_MODEL_COST_MAP | Local configuration for model cost mapping in LiteLLM | 
| LITELLM_LOG | Enable detailed logging for LiteLLM | 
| LITELLM_LOG_FILE | File path to write LiteLLM logs to. When set, logs will be written to both console and the specified file | 
| LITELLM_LOGGER_NAME | Name for OTEL logger | 
| LITELLM_METER_NAME | Name for OTEL Meter | 
| LITELLM_OTEL_INTEGRATION_ENABLE_EVENTS | Optionally enable semantic logs for OTEL | 
| LITELLM_OTEL_INTEGRATION_ENABLE_METRICS | Optionally enable emantic metrics for OTEL | 
| LITELLM_MASTER_KEY | Master key for proxy authentication | 
| LITELLM_MODE | Operating mode for LiteLLM (e.g., production, development) | 
| LITELLM_RATE_LIMIT_WINDOW_SIZE | Rate limit window size for LiteLLM. Default is 60 | 
| LITELLM_SALT_KEY | Salt key for encryption in LiteLLM | 
| LITELLM_SSL_CIPHERS | SSL/TLS cipher configuration for faster handshakes. Controls cipher suite preferences for OpenSSL connections. | 
| LITELLM_SECRET_AWS_KMS_LITELLM_LICENSE | AWS KMS encrypted license for LiteLLM | 
| LITELLM_TOKEN | Access token for LiteLLM integration | 
| LITELLM_PRINT_STANDARD_LOGGING_PAYLOAD | If true, prints the standard logging payload to the console - useful for debugging | 
| LITELM_ENVIRONMENT | Environment for LiteLLM Instance. This is currently only logged to DeepEval to determine the environment for DeepEval integration. | 
| LOGFIRE_TOKEN | Token for Logfire logging service | 
| MAX_EXCEPTION_MESSAGE_LENGTH | Maximum length for exception messages. Default is 2000 | 
| MAX_STRING_LENGTH_PROMPT_IN_DB | Maximum length for strings in spend logs when sanitizing request bodies. Strings longer than this will be truncated. Default is 1000 | 
| MAX_IN_MEMORY_QUEUE_FLUSH_COUNT | Maximum count for in-memory queue flush operations. Default is 1000 | 
| MAX_LONG_SIDE_FOR_IMAGE_HIGH_RES | Maximum length for the long side of high-resolution images. Default is 2000 | 
| MAX_REDIS_BUFFER_DEQUEUE_COUNT | Maximum count for Redis buffer dequeue operations. Default is 100 | 
| MAX_SHORT_SIDE_FOR_IMAGE_HIGH_RES | Maximum length for the short side of high-resolution images. Default is 768 | 
| MAX_SIZE_IN_MEMORY_QUEUE | Maximum size for in-memory queue. Default is 10000 | 
| MAX_SIZE_PER_ITEM_IN_MEMORY_CACHE_IN_KB | Maximum size in KB for each item in memory cache. Default is 512 or 1024 | 
| MAX_SPENDLOG_ROWS_TO_QUERY | Maximum number of spend log rows to query. Default is 1,000,000 | 
| MAX_TEAM_LIST_LIMIT | Maximum number of teams to list. Default is 20 | 
| MAX_TILE_HEIGHT | Maximum height for image tiles. Default is 512 | 
| MAX_TILE_WIDTH | Maximum width for image tiles. Default is 512 | 
| MAX_TOKEN_TRIMMING_ATTEMPTS | Maximum number of attempts to trim a token message. Default is 10 | 
| MAXIMUM_TRACEBACK_LINES_TO_LOG | Maximum number of lines to log in traceback in LiteLLM Logs UI. Default is 100 | 
| MAX_RETRY_DELAY | Maximum delay in seconds for retrying requests. Default is 8.0 | 
| MAX_LANGFUSE_INITIALIZED_CLIENTS | Maximum number of Langfuse clients to initialize on proxy. Default is 20. This is set since langfuse initializes 1 thread everytime a client is initialized. We've had an incident in the past where we reached 100% cpu utilization because Langfuse was initialized several times. | 
| MIN_NON_ZERO_TEMPERATURE | Minimum non-zero temperature value. Default is 0.0001 | 
| MINIMUM_PROMPT_CACHE_TOKEN_COUNT | Minimum token count for caching a prompt. Default is 1024 | 
| MISTRAL_API_BASE | Base URL for Mistral API. Default is https://api.mistral.ai | 
| MISTRAL_API_KEY | API key for Mistral API | 
| MICROSOFT_CLIENT_ID | Client ID for Microsoft services | 
| MICROSOFT_CLIENT_SECRET | Client secret for Microsoft services | 
| MICROSOFT_TENANT | Tenant ID for Microsoft Azure | 
| MICROSOFT_SERVICE_PRINCIPAL_ID | Service Principal ID for Microsoft Enterprise Application. (This is an advanced feature if you want litellm to auto-assign members to Litellm Teams based on their Microsoft Entra ID Groups) | 
| NO_DOCS | Flag to disable Swagger UI documentation | 
| NO_REDOC | Flag to disable Redoc documentation | 
| NO_PROXY | List of addresses to bypass proxy | 
| NON_LLM_CONNECTION_TIMEOUT | Timeout in seconds for non-LLM service connections. Default is 15 | 
| OAUTH_TOKEN_INFO_ENDPOINT | Endpoint for OAuth token info retrieval | 
| OPENAI_BASE_URL | Base URL for OpenAI API | 
| OPENAI_API_BASE | Base URL for OpenAI API. Default is https://api.openai.com/ | 
| OPENAI_API_KEY | API key for OpenAI services | 
| OPENAI_FILE_SEARCH_COST_PER_1K_CALLS | Cost per 1000 calls for OpenAI file search. Default is 0.0025 | 
| OPENAI_ORGANIZATION | Organization identifier for OpenAI | 
| OPENID_BASE_URL | Base URL for OpenID Connect services | 
| OPENID_CLIENT_ID | Client ID for OpenID Connect authentication | 
| OPENID_CLIENT_SECRET | Client secret for OpenID Connect authentication | 
| OPENMETER_API_ENDPOINT | API endpoint for OpenMeter integration | 
| OPENMETER_API_KEY | API key for OpenMeter services | 
| OPENMETER_EVENT_TYPE | Type of events sent to OpenMeter | 
| OTEL_ENDPOINT | OpenTelemetry endpoint for traces | 
| OTEL_EXPORTER_OTLP_ENDPOINT | OpenTelemetry endpoint for traces | 
| OTEL_ENVIRONMENT_NAME | Environment name for OpenTelemetry | 
| OTEL_EXPORTER | Exporter type for OpenTelemetry | 
| OTEL_EXPORTER_OTLP_PROTOCOL | Exporter type for OpenTelemetry | 
| OTEL_HEADERS | Headers for OpenTelemetry requests | 
| OTEL_MODEL_ID | Model ID for OpenTelemetry tracing | 
| OTEL_EXPORTER_OTLP_HEADERS | Headers for OpenTelemetry requests | 
| OTEL_SERVICE_NAME | Service name identifier for OpenTelemetry | 
| OTEL_TRACER_NAME | Tracer name for OpenTelemetry tracing | 
| PAGERDUTY_API_KEY | API key for PagerDuty Alerting | 
| PANW_PRISMA_AIRS_API_KEY | API key for PANW Prisma AIRS service | 
| PANW_PRISMA_AIRS_API_BASE | Base URL for PANW Prisma AIRS service | 
| PHOENIX_API_KEY | API key for Arize Phoenix | 
| PHOENIX_COLLECTOR_ENDPOINT | API endpoint for Arize Phoenix | 
| PHOENIX_COLLECTOR_HTTP_ENDPOINT | API http endpoint for Arize Phoenix | 
| PILLAR_API_BASE | Base URL for Pillar API Guardrails | 
| PILLAR_API_KEY | API key for Pillar API Guardrails | 
| PILLAR_ON_FLAGGED_ACTION | Action to take when content is flagged ('block' or 'monitor') | 
| POD_NAME | Pod name for the server, this will be emitted to datadoglogs asPOD_NAME | 
| POSTHOG_API_KEY | API key for PostHog analytics integration | 
| POSTHOG_API_URL | Base URL for PostHog API (defaults to https://us.i.posthog.com) | 
| PREDIBASE_API_BASE | Base URL for Predibase API | 
| PRESIDIO_ANALYZER_API_BASE | Base URL for Presidio Analyzer service | 
| PRESIDIO_ANONYMIZER_API_BASE | Base URL for Presidio Anonymizer service | 
| PROMETHEUS_BUDGET_METRICS_REFRESH_INTERVAL_MINUTES | Refresh interval in minutes for Prometheus budget metrics. Default is 5 | 
| PROMETHEUS_FALLBACK_STATS_SEND_TIME_HOURS | Fallback time in hours for sending stats to Prometheus. Default is 9 | 
| PROMETHEUS_URL | URL for Prometheus service | 
| PROMPTLAYER_API_KEY | API key for PromptLayer integration | 
| PROXY_ADMIN_ID | Admin identifier for proxy server | 
| PROXY_BASE_URL | Base URL for proxy service | 
| PROXY_BATCH_WRITE_AT | Time in seconds to wait before batch writing spend logs to the database. Default is 30 | 
| PROXY_BATCH_POLLING_INTERVAL | Time in seconds to wait before polling a batch, to check if it's completed. Default is 6000s (1 hour) | 
| PROXY_BUDGET_RESCHEDULER_MAX_TIME | Maximum time in seconds to wait before checking database for budget resets. Default is 605 | 
| PROXY_BUDGET_RESCHEDULER_MIN_TIME | Minimum time in seconds to wait before checking database for budget resets. Default is 597 | 
| PYTHON_GC_THRESHOLD | GC thresholds ('gen0,gen1,gen2', e.g. '1000,50,50'); defaults to Python’s values. | 
| PROXY_LOGOUT_URL | URL for logging out of the proxy service | 
| QDRANT_API_BASE | Base URL for Qdrant API | 
| QDRANT_API_KEY | API key for Qdrant service | 
| QDRANT_SCALAR_QUANTILE | Scalar quantile for Qdrant operations. Default is 0.99 | 
| QDRANT_URL | Connection URL for Qdrant database | 
| QDRANT_VECTOR_SIZE | Vector size for Qdrant operations. Default is 1536 | 
| REDIS_CONNECTION_POOL_TIMEOUT | Timeout in seconds for Redis connection pool. Default is 5 | 
| REDIS_HOST | Hostname for Redis server | 
| REDIS_PASSWORD | Password for Redis service | 
| REDIS_PORT | Port number for Redis server | 
| REDIS_SOCKET_TIMEOUT | Timeout in seconds for Redis socket operations. Default is 0.1 | 
| REDIS_GCP_SERVICE_ACCOUNT | GCP service account for IAM authentication with Redis. Format: "projects/-/serviceAccounts/name@project.iam.gserviceaccount.com" | 
| REDIS_GCP_SSL_CA_CERTS | Path to SSL CA certificate file for secure GCP Memorystore Redis connections | 
| REDOC_URL | The path to the Redoc Fast API documentation. By default this is "/redoc" | 
| REPEATED_STREAMING_CHUNK_LIMIT | Limit for repeated streaming chunks to detect looping. Default is 100 | 
| REALTIME_WEBSOCKET_MAX_MESSAGE_SIZE_BYTES | Maximum size in bytes for WebSocket messages in realtime connections. Default is None. | 
| REPLICATE_MODEL_NAME_WITH_ID_LENGTH | Length of Replicate model names with ID. Default is 64 | 
| REPLICATE_POLLING_DELAY_SECONDS | Delay in seconds for Replicate polling operations. Default is 0.5 | 
| REQUEST_TIMEOUT | Timeout in seconds for requests. Default is 6000 | 
| ROUTER_MAX_FALLBACKS | Maximum number of fallbacks for router. Default is 5 | 
| SECRET_MANAGER_REFRESH_INTERVAL | Refresh interval in seconds for secret manager. Default is 86400 (24 hours) | 
| SEPARATE_HEALTH_APP | If set to '1', runs health endpoints on a separate ASGI app and port. Default: '0'. | 
| SEPARATE_HEALTH_PORT | Port for the separate health endpoints app. Only used if SEPARATE_HEALTH_APP=1. Default: 4001. | 
| SERVER_ROOT_PATH | Root path for the server application | 
| SET_VERBOSE | Flag to enable verbose logging | 
| SINGLE_DEPLOYMENT_TRAFFIC_FAILURE_THRESHOLD | Minimum number of requests to consider "reasonable traffic" for single-deployment cooldown logic. Default is 1000 | 
| SLACK_DAILY_REPORT_FREQUENCY | Frequency of daily Slack reports (e.g., daily, weekly) | 
| SLACK_WEBHOOK_URL | Webhook URL for Slack integration | 
| SMTP_HOST | Hostname for the SMTP server | 
| SMTP_PASSWORD | Password for SMTP authentication (do not set if SMTP does not require auth) | 
| SMTP_PORT | Port number for SMTP server | 
| SMTP_SENDER_EMAIL | Email address used as the sender in SMTP transactions | 
| SMTP_SENDER_LOGO | Logo used in emails sent via SMTP | 
| SMTP_TLS | Flag to enable or disable TLS for SMTP connections | 
| SMTP_USERNAME | Username for SMTP authentication (do not set if SMTP does not require auth) | 
| SPEND_LOGS_URL | URL for retrieving spend logs | 
| SPEND_LOG_CLEANUP_BATCH_SIZE | Number of logs deleted per batch during cleanup. Default is 1000 | 
| SSL_CERTIFICATE | Path to the SSL certificate file | 
| SSL_ECDH_CURVE | ECDH curve for SSL/TLS key exchange (e.g., 'X25519' to disable PQC). | 
| SSL_SECURITY_LEVEL | [BETA] Security level for SSL/TLS connections. E.g. DEFAULT@SECLEVEL=1 | 
| SSL_VERIFY | Flag to enable or disable SSL certificate verification | 
| SSL_CERT_FILE | Path to the SSL certificate file for custom CA bundle | 
| SUPABASE_KEY | API key for Supabase service | 
| SUPABASE_URL | Base URL for Supabase instance | 
| STORE_MODEL_IN_DB | If true, enables storing model + credential information in the DB. | 
| SYSTEM_MESSAGE_TOKEN_COUNT | Token count for system messages. Default is 4 | 
| TEST_EMAIL_ADDRESS | Email address used for testing purposes | 
| TOGETHER_AI_4_B | Size parameter for Together AI 4B model. Default is 4 | 
| TOGETHER_AI_8_B | Size parameter for Together AI 8B model. Default is 8 | 
| TOGETHER_AI_21_B | Size parameter for Together AI 21B model. Default is 21 | 
| TOGETHER_AI_41_B | Size parameter for Together AI 41B model. Default is 41 | 
| TOGETHER_AI_80_B | Size parameter for Together AI 80B model. Default is 80 | 
| TOGETHER_AI_110_B | Size parameter for Together AI 110B model. Default is 110 | 
| TOGETHER_AI_EMBEDDING_150_M | Size parameter for Together AI 150M embedding model. Default is 150 | 
| TOGETHER_AI_EMBEDDING_350_M | Size parameter for Together AI 350M embedding model. Default is 350 | 
| TOOL_CHOICE_OBJECT_TOKEN_COUNT | Token count for tool choice objects. Default is 4 | 
| UI_LOGO_PATH | Path to the logo image used in the UI | 
| UI_PASSWORD | Password for accessing the UI | 
| UI_USERNAME | Username for accessing the UI | 
| UPSTREAM_LANGFUSE_DEBUG | Flag to enable debugging for upstream Langfuse | 
| UPSTREAM_LANGFUSE_HOST | Host URL for upstream Langfuse service | 
| UPSTREAM_LANGFUSE_PUBLIC_KEY | Public key for upstream Langfuse authentication | 
| UPSTREAM_LANGFUSE_RELEASE | Release version identifier for upstream Langfuse | 
| UPSTREAM_LANGFUSE_SECRET_KEY | Secret key for upstream Langfuse authentication | 
| USE_AWS_KMS | Flag to enable AWS Key Management Service for encryption | 
| USE_PRISMA_MIGRATE | Flag to use prisma migrate instead of prisma db push. Recommended for production environments. | 
| WEBHOOK_URL | URL for receiving webhooks from external services | 
| SPEND_LOG_RUN_LOOPS | Constant for setting how many runs of 1000 batch deletes should spend_log_cleanup task run | 
| SPEND_LOG_CLEANUP_BATCH_SIZE | Number of logs deleted per batch during cleanup. Default is 1000 | 
| COROUTINE_CHECKER_MAX_SIZE_IN_MEMORY | Maximum size for CoroutineChecker in-memory cache. Default is 1000 | 
| DEFAULT_SHARED_HEALTH_CHECK_TTL | Time-to-live in seconds for cached health check results in shared health check mode. Default is 300 (5 minutes) | 
| DEFAULT_SHARED_HEALTH_CHECK_LOCK_TTL | Time-to-live in seconds for health check lock in shared health check mode. Default is 60 (1 minute) |