Skip to content
@FoundationVision

FoundationVision

Hi there 👋

This is FoundationVision official website repo

Popular repositories Loading

  1. VAR VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

    Python 4.3k 321

  2. LlamaGen LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    Python 1.4k 56

  3. GLEE GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    Python 1.1k 86

  4. Groma Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    Python 572 61

  5. OmniTokenizer OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    Python 265 7

  6. UniRef UniRef Public

    [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

    Python 235 15

Repositories

Showing 9 of 9 repositories
  • GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    FoundationVision/GLEE’s past year of commit activity
    Python 1,087 MIT 86 38 2 Updated Oct 21, 2024
  • VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

    FoundationVision/VAR’s past year of commit activity
    Python 4,321 MIT 321 32 0 Updated Oct 6, 2024
  • LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    FoundationVision/LlamaGen’s past year of commit activity
    Python 1,354 MIT 56 50 0 Updated Aug 15, 2024
  • OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    FoundationVision/OmniTokenizer’s past year of commit activity
    Python 265 MIT 7 8 0 Updated Jul 9, 2024
  • vaex Public

    🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

    FoundationVision/vaex’s past year of commit activity
    Python 43 MIT 2 1 0 Updated Jun 23, 2024
  • Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    FoundationVision/Groma’s past year of commit activity
    Python 572 Apache-2.0 61 8 1 Updated Jun 7, 2024
  • GenerateU Public

    [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

    FoundationVision/GenerateU’s past year of commit activity
    Python 146 6 13 0 Updated Mar 25, 2024
  • UniRef Public

    [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

    FoundationVision/UniRef’s past year of commit activity
    Python 235 MIT 15 4 0 Updated Jan 10, 2024
  • .github Public
    FoundationVision/.github’s past year of commit activity
    0 0 0 0 Updated Dec 16, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python