MLX
Apple's machine learning framework built for Apple silicon.
Short, clear definitions for the words that make docs feel heavier than they need to be.
Apple's machine learning framework built for Apple silicon.
Apple's M-series chips, like M1, M2, M3, and newer.
A grid of numbers in memory. In ML work, arrays hold inputs, activations, and weights.
A general ML word for a multi-dimensional array. Same basic idea, just broader terminology.
Automatic differentiation: the system that figures out gradients for training.
A signal showing how to change model weights to reduce error.
Work gets delayed until the result is actually needed, which can help performance.
The computation graph forms as your code runs, which makes experimentation easier.
CPU and GPU share memory on Apple silicon, reducing expensive copying.
Running a trained model to get an output.
Training an existing model a bit more so it gets better at a specific job.
A lightweight fine-tuning method that updates a small adapter instead of the whole model.
Shrinking model precision so it uses less memory and often runs faster.
The tool that breaks text into pieces a model can process.
The learned numbers inside a model. Loading weights means loading what the model has learned.
A saved model state you can reload later.
The graphics processor, often used to speed up ML computation.
A group of examples processed together for efficiency.
How much text a language model can consider at once.
The instructions or input text you feed into a model.
A vision-language model: a model that can reason across images and text.
A model format and interchange layer used to move models across frameworks or runtimes.
A way to use the same core library from another language like Swift or C.
Running the model on your own machine instead of calling a cloud API.