in General Model
Choose the optimal model mode in general model
ZETIC.MLange provides several custom modes for model inference to balance between speed and accuracy based on your needs.
Available Modes
Default (Auto)
Intelligently balances speed and accuracy for optimal performance. This mode automatically selects the fastest configuration while ensuring high-quality results (SNR > 20dB), making it ideal for most use cases.
Speed First
Maximizes inference speed with minimum latency. Recommended for real-time applications where response time is the top priority.
Accuracy First
Delivers the highest precision based on maximum SNR scores. Best suited for applications where accuracy is more critical than speed.
The optimal mode is automatically determined based on:
- Speed metrics: Inference time (latency in ms)
- Accuracy metrics: SNR (Signal-to-Noise Ratio in dB)
You can override this automatic selection by explicitly specifying a mode.
API Usage
// Default: Model Load (same as ModelMode.RUN_AUTO)
// Speed first, but maintains SNR above 20.f
private val model_default = ZeticMLangeModel(
CONTEXT,
$PERSONAL_KEY,
$MODEL_NAME,
$VERSION,
ModelMode.RUN_AUTO
)
// Speed First Mode
private val model_fast = ZeticMLangeModel(
CONTEXT,
$PERSONAL_KEY,
$MODEL_NAME,
$VERSION,
ModelMode.RUN_SPEED
)
// Accuracy First Mode
private val model_accurate = ZeticMLangeModel(
CONTEXT,
$PERSONAL_KEY,
$MODEL_NAME,
$VERSION,
ModelMode.RUN_ACCURACY
)// Default: Model Load (same as ModelMode.RUN_AUTO)
// Speed first, but maintains SNR above 20.f
let model_default = try ZeticMLangeModel(
$PERSONAL_KEY,
$MODEL_NAME,
$VERSION,
ModelMode.RUN_AUTO
)
// Speed First Mode
let model_fast = try ZeticMLangeModel(
$PERSONAL_KEY,
$MODEL_NAME,
$VERSION,
ModelMode.RUN_SPEED
)
// Accuracy First Mode
let model_accurate = try ZeticMLangeModel(
$PERSONAL_KEY,
$MODEL_NAME,
$VERSION,
ModelMode.RUN_ACCURACY
)Contact Us
Collaborations or update requests are always welcome! Please contact us at contact@zetic.ai.