Image Data Encoding Task
Jump to navigation
Jump to search
An Image Data Encoding Task is an data encoding task for image data.
- Context:
- It can (typically) involve converting raw image data into high-dimensional feature embeddings.
- It can (typically) be performed by an Image Encodinct System (that implements an image encoding algorithm).
- It can range from encoding small image patches to entire gigapixel images.
- It can produce outputs that serve as inputs for downstream tasks like image retrieval or image generation.
- It can handle both 2D images and 3D volumetric data in fields like medical imaging.
- ...
- Example(s):
- a CNN-based image encoder used for image classification in a real-time image recognition system.
- a Vision Transformer model that processes gigapixel pathology slides to extract meaningful slide-level embeddings.
- ...
- Counter-Example(s):
- Text Data Encoding, which involve converting text into text embeddings.
- Audio Data Encoding, which focus on encoding audio signals into audio feature vectors.
- Image Embedding Decodeing, which ...
- See: Text Data Encoding, Audio Data Encoding, Feature Extraction, Convolutional Neural Networks, Vision Transformers.
References
2024
- (Xu, Usuyama et al., 2024) ⇒ Hanwen Xu, Naoto Usuyama, Jaspreet Bagga, Sheng Zhang, Rajesh Rao, Tristan Naumann, Cliff Wong, Zelalem Gero, Javier González, Yu Gu, Yanbo Xu, Mu Wei, Wenhui Wang, Shuming Ma, Furu Wei, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Jaylen Rosemon, Tucker Bower, Soohee Lee, Roshanthi Weerasinghe, Bill J. Wright, Ari Robicsek, Brian Piening, Carlo Bifulco, Sheng Wang, and Hoifung Poon. (2024). “A Whole-slide Foundation Model for Digital Pathology from Real-world Data.” In: Nature. doi:10.1038/s41586-024-07441-w