We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation .
Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users .
You must be logged in to block users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse .
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Python 2.9k 264
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
Python 579 37
[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
32 1