Large Multimodal Model (LMM)

A Large Multimodal Model (LMM) is a deep learning model that can process and generate different modalities of data, such as text, images, audio, and video.



References

2023