We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation .
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Python 1.5k 99
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Python 2.5k 241
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Python 2.9k 264
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
Python 579 37
Video-P2P: Video Editing with Cross-attention Control
Python 332 22
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'
Python 255 17
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)
[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs