Skip to main content

UTMSYSStart Community

简体中文
English

1. Product Overview
2. Hardware Connection
3. Image Flashing
4. Terminal Access
5. System Configuration
6. Software Installation
7. Data Communication
8. Application Development
9. Algorithm Development
10. Debug Commands
11. Concepts
12. FAQ
13. Release
UTMSYS

9. Algorithm Development
9.7 VLM

9.7 VLM

VLM

📄️ VLM

CLIP is a multimodal machine learning model proposed by OpenAI. By performing comparative learning on large-scale image and text pairs, the model can simultaneously process images and text, mapping them into a shared vector space. This example demonstrates using CLIP for image management and text search on the RDK platform.

Links

Homepage
Contact Us

Follow Us

GitHub
Youtube
Facebook
Instagram

Copyright © 2025 utmsys.