1 Matching Annotations
- Last 7 days
-
arxiv.org arxiv.org
-
The large LLMs [48] now boosts up to 1000billion parameters, while the widely-used vision encodersof VLLMs are still around one billion. This gap may leadto the under-use of LLM’s capacity.
Key phrase being - align with parameter scale of the LLM
-