mit einem Klick
tracking-usage
// Track usage with output metadata in inference.sh apps. Use when implementing billing, counting tokens, or reporting image/video/audio generation metrics.
// Track usage with output metadata in inference.sh apps. Use when implementing billing, counting tokens, or reporting image/video/audio generation metrics.
| name | tracking-usage |
| description | Track usage with output metadata in inference.sh apps. Use when implementing billing, counting tokens, or reporting image/video/audio generation metrics. |
Enable usage-based pricing by reporting what your app processes.
| Type | Class | Fields |
|---|---|---|
| Text | TextMeta | tokens |
| Image | ImageMeta | width, height, steps, count |
| Video | VideoMeta | width, height, seconds |
| Audio | AudioMeta | seconds |
from inferencesh.models.usage import OutputMeta, TextMeta, ImageMeta, VideoMeta, AudioMeta
# LLM tokens
output_meta=OutputMeta(
inputs=[TextMeta(tokens=prompt_tokens)],
outputs=[TextMeta(tokens=completion_tokens)]
)
# Image generation
output_meta=OutputMeta(
outputs=[ImageMeta(width=1024, height=1024, steps=20, count=1)]
)
# Video generation
output_meta=OutputMeta(
outputs=[VideoMeta(width=1280, height=720, seconds=5.0)]
)
# Audio generation
output_meta=OutputMeta(
outputs=[AudioMeta(seconds=30.0)]
)
output_meta=OutputMeta(
outputs=[ImageMeta(
width=1024, height=1024,
extra={"model": "sdxl-turbo", "lora_count": 2}
)]
)
📖 Full docs: inference.sh/docs/extend/output-meta
Configure inf.yml for inference.sh apps. Use when setting GPU, VRAM, RAM, categories, environment variables, packages.txt, or resource requirements.
Debug and troubleshoot inference.sh apps. Use when facing import errors, CUDA issues, memory problems, or deployment failures.
Handle graceful cancellation in inference.sh apps. Use when implementing long-running tasks that users might cancel.
Handle API keys and sensitive values in inference.sh apps. Use when adding secrets, accessing environment variables, or securing credentials.
Optimize inference.sh app performance. Use when handling memory, devices, model loading, mixed precision, or flash attention.
Build and deploy applications on inference.sh. Use when getting started, understanding the platform, or needing an overview of inference.sh development.