Basics

mean

  • torch: keepdim

  • mlx: keepdims

  • torch: dim

  • mlx: axis

cat

torch uses cat, while mlx uses concatenate.

torch uses dim in cat, mlx uses axis in concatenate

Sequential

torch users this_layer[0], while mx uses this_layer.layers[0]