GPU Inference Pulse (SLM Edition)
SYSTEM:
L4 24GB (Standard)
A100 40GB (High)
H100 80GB (Mega)
MODEL:
Gemma 3 1B
Gemma 3 4B
Gemma 3 12B
Gemma 3 27B
RESET GRID
SYSTEM IDLE.
INITIATE PROMPT:
SHORT (3)
MEDIUM (5)
LONG (15)
SIMULATE LOAD:
WORKDAY
BATCH RUN
BURST HELL
LEGEND:
Model Weights Memory Footprint
Short Prompt Input Tokens
Medium Prompt Input Tokens
Long Prompt Input Tokens
Processing Input Tokens
Generated Tokens (Output)
Free Memory
fredmo