VLA = Vision-language-action model: https://en.wikipedia.org/wiki/Vision-language-action_model
Not https://public.nrao.edu/telescopes/VLA/ :(
For completeness, MMLLM = Multimodal Large language model.