Improve CoMPaSS-FLUX.1 Model Card: Add pipeline_tag, library_name, and format links
Browse filesThis PR enhances the model card for CoMPaSS-FLUX.1 by:
- **Adding `pipeline_tag: text-to-image` and `library_name: diffusers`** to the metadata. This ensures the model is correctly categorized on the Hugging Face Hub and enables the automated "Use in Diffusers" widget, as the model explicitly states "Framework: Diffusers" in its description.
- **Improving the visibility of key links** (Project Page, Code, arXiv paper) by moving them to prominent inline links at the top of the model card.
- **Integrating the paper title** "CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models" into the main heading and the model description for better context.
- **Removing a redundant heading** and updating the "Using the Model" link for overall clarity and consistency.
These changes will make the model more discoverable and easier to use for the community.
| 
         @@ -1,4 +1,10 @@ 
     | 
|
| 1 | 
         
             
            ---
         
     | 
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 2 | 
         
             
            tags:
         
     | 
| 3 | 
         
             
            - text-to-image
         
     | 
| 4 | 
         
             
            - lora
         
     | 
| 
         @@ -14,26 +20,18 @@ widget: 
     | 
|
| 14 | 
         
             
            - text: a photo of a horse to the left of a bottle
         
     | 
| 15 | 
         
             
              output:
         
     | 
| 16 | 
         
             
                url: images/horse-left-bottle.jpg
         
     | 
| 17 | 
         
            -
            base_model: black-forest-labs/FLUX.1-dev
         
     | 
| 18 | 
         
            -
            instance_prompt: null
         
     | 
| 19 | 
         
            -
            license: other
         
     | 
| 20 | 
         
            -
            license_name: compass-lora-weights-nc-license
         
     | 
| 21 | 
         
            -
            license_link: LICENSE
         
     | 
| 22 | 
         
             
            ---
         
     | 
| 23 | 
         
            -
            # CoMPaSS-FLUX.1
         
     | 
| 24 | 
         | 
| 25 | 
         
            -
             
     | 
| 26 | 
         
            -
            \[[code]\]
         
     | 
| 27 | 
         
            -
            \[[arXiv]\]
         
     | 
| 28 | 
         | 
| 29 | 
         
            -
             
     | 
| 30 | 
         | 
| 31 | 
         
            -
             
     | 
| 32 | 
         | 
| 33 | 
         
            -
             
     | 
| 34 | 
         | 
| 35 | 
         
             
            A LoRA adapter that enhances spatial understanding capabilities of the FLUX.1 text-to-image
         
     | 
| 36 | 
         
            -
            diffusion model. This model demonstrates significant improvements in generating images with specific
         
     | 
| 37 | 
         
             
            spatial relationships between objects.
         
     | 
| 38 | 
         | 
| 39 | 
         
             
            ## Model Details
         
     | 
| 
         @@ -62,7 +60,7 @@ spatial relationships between objects. 
     | 
|
| 62 | 
         | 
| 63 | 
         
             
            ## Using the Model
         
     | 
| 64 | 
         | 
| 65 | 
         
            -
            See our [GitHub repository] 
     | 
| 66 | 
         | 
| 67 | 
         
             
            ### Effective Prompting
         
     | 
| 68 | 
         | 
| 
         @@ -124,7 +122,4 @@ Weights for this model are available in Safetensors format. 
     | 
|
| 124 | 
         | 
| 125 | 
         
             
            [Download](/blurgy/CoMPaSS-FLUX.1/tree/main) them in the Files & versions tab.
         
     | 
| 126 | 
         | 
| 127 | 
         
            -
            [./LICENSE]: <./LICENSE>
         
     | 
| 128 | 
         
            -
            [Project page]: <https://compass.blurgy.xyz>
         
     | 
| 129 | 
         
            -
            [code]: <https://github.com/blurgyy/CoMPaSS>
         
     | 
| 130 | 
         
            -
            [arXiv]: <https://arxiv.org/abs/2412.13195>
         
     | 
| 
         | 
|
| 1 | 
         
             
            ---
         
     | 
| 2 | 
         
            +
            base_model: black-forest-labs/FLUX.1-dev
         
     | 
| 3 | 
         
            +
            license: other
         
     | 
| 4 | 
         
            +
            license_name: compass-lora-weights-nc-license
         
     | 
| 5 | 
         
            +
            license_link: LICENSE
         
     | 
| 6 | 
         
            +
            pipeline_tag: text-to-image
         
     | 
| 7 | 
         
            +
            library_name: diffusers
         
     | 
| 8 | 
         
             
            tags:
         
     | 
| 9 | 
         
             
            - text-to-image
         
     | 
| 10 | 
         
             
            - lora
         
     | 
| 
         | 
|
| 20 | 
         
             
            - text: a photo of a horse to the left of a bottle
         
     | 
| 21 | 
         
             
              output:
         
     | 
| 22 | 
         
             
                url: images/horse-left-bottle.jpg
         
     | 
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 
         | 
|
| 23 | 
         
             
            ---
         
     | 
| 
         | 
|
| 24 | 
         | 
| 25 | 
         
            +
            # CoMPaSS-FLUX.1: Enhancing Spatial Understanding in Text-to-Image Diffusion Models
         
     | 
| 
         | 
|
| 
         | 
|
| 26 | 
         | 
| 27 | 
         
            +
            [Project Page](https://compass.blurgy.xyz) | [Code](https://github.com/blurgyy/CoMPaSS) | [arXiv](https://arxiv.org/abs/2412.13195)
         
     | 
| 28 | 
         | 
| 29 | 
         
            +
            <Gallery />
         
     | 
| 30 | 
         | 
| 31 | 
         
            +
            ## Model description
         
     | 
| 32 | 
         | 
| 33 | 
         
             
            A LoRA adapter that enhances spatial understanding capabilities of the FLUX.1 text-to-image
         
     | 
| 34 | 
         
            +
            diffusion model. This model, presented in [CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models](https://arxiv.org/abs/2412.13195), demonstrates significant improvements in generating images with specific
         
     | 
| 35 | 
         
             
            spatial relationships between objects.
         
     | 
| 36 | 
         | 
| 37 | 
         
             
            ## Model Details
         
     | 
| 
         | 
|
| 60 | 
         | 
| 61 | 
         
             
            ## Using the Model
         
     | 
| 62 | 
         | 
| 63 | 
         
            +
            See our [GitHub repository](https://github.com/blurgyy/CoMPaSS) to get started.
         
     | 
| 64 | 
         | 
| 65 | 
         
             
            ### Effective Prompting
         
     | 
| 66 | 
         | 
| 
         | 
|
| 122 | 
         | 
| 123 | 
         
             
            [Download](/blurgy/CoMPaSS-FLUX.1/tree/main) them in the Files & versions tab.
         
     | 
| 124 | 
         | 
| 125 | 
         
            +
            [./LICENSE]: <./LICENSE>
         
     | 
| 
         | 
|
| 
         | 
|
| 
         |