Supported Model Formats

Melange supports two model formats for on-device deployment. Choose the format that best matches your training framework.

Supported Formats

Format	Extension	Status	Recommended
PyTorch Exported Program	`.pt2`	Supported	Yes
ONNX	`.onnx`	Supported	Yes

Choosing a Format

PyTorch users: Use PyTorch Exported Program (.pt2). Requires PyTorch >= 2.9.
TensorFlow / Keras / scikit-learn users: Convert to ONNX format.

Input Requirements

Regardless of format, all models require:

NumPy input files (.npy): Sample inputs that define the expected tensor shapes and data types.
Fixed input shapes: NPU compilation hard-codes input shapes for maximum throughput. Even if your original model supports dynamic sizes, the Melange-compiled model will accept only the exact shape provided during upload.

NPU compilation hard-codes input shapes to maximize throughput. Even if your original model supports dynamic sizes, the accelerated Melange model will accept only the exact shape of the sample input provided during upload.

Graph Constraints

Melange compiles your model into a static computation graph, which places constraints on the graph itself beyond just input shapes. Address these before exporting.

Fixed Shapes Throughout the Graph

Every tensor shape inside the graph — not only the inputs — must resolve to a constant at export time. If any intermediate operation produces a shape that depends on runtime values, compilation will fail.

Common sources of dynamic internal shapes:

Data-dependent control flow (if / while conditioned on tensor values)
Operations like nonzero, masked_select, or slicing with indices that aren't compile-time constants
Variable-length sequences without padding

Refactor these into fixed-shape equivalents (for example, pad to a maximum length and apply a mask instead of gathering a dynamic subset) so that all shapes become constants during export.

Open your model (.pt2 or .onnx) in Netron.
Locate the Input Nodes at the top of the graph.
Note the vertical order: the top-most input is Index 0, the next is Index 1, and so on.

You must provide inputs in this exact order when:

Uploading the model and sample inputs via the Melange Dashboard.
Calling the run() function in your Android/iOS app.

Checking input sequence with Netron

Next Steps

PyTorch Export: Export PyTorch models to .pt2
ONNX Models: Convert TensorFlow, Keras, and scikit-learn models
Pre-built Models: Use ready-to-run models from Dashboard or Hugging Face

Supported Model Formats

Supported Formats

Choosing a Format

Input Requirements

Graph Constraints

Fixed Shapes Throughout the Graph

No Complex Number Support

Verifying Input Order and Shapes

Why Order Matters

Inspecting with Netron

Next Steps

On this page