Update README.md
Browse files
    	
        README.md
    CHANGED
    
    | @@ -1,3 +1,52 @@ | |
| 1 | 
            -
            ---
         | 
| 2 | 
            -
            license: cc-by-nc-sa-4.0
         | 
| 3 | 
            -
            ---
         | 
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            ---
         | 
| 2 | 
            +
            license: cc-by-nc-sa-4.0
         | 
| 3 | 
            +
            ---
         | 
| 4 | 
            +
             | 
| 5 | 
            +
            # 4DGT Model Card
         | 
| 6 | 
            +
             | 
| 7 | 
            +
            ## Model Details
         | 
| 8 | 
            +
             | 
| 9 | 
            +
            4DGT (4D Gaussian Transformer) is a neural network model that learns dynamic 3D Gaussian representations from monocular videos. It uses a transformer-based architecture to predict 4D Gaussians from a dynamic scenes observed from an egocentric video.
         | 
| 10 | 
            +
             | 
| 11 | 
            +
            - **Paper:** [4DGT: Learning a 4D Gaussian Transformer Using Real-World Monocular Videos](https://arxiv.org/abs/2506.08015)
         | 
| 12 | 
            +
            - **Project Page:** [https://4dgt.github.io/](https://4dgt.github.io/)
         | 
| 13 | 
            +
            - **Github:** [GitHub repository](https://github.com/facebookresearch/4dgt)
         | 
| 14 | 
            +
             | 
| 15 | 
            +
            Please refer to the project page and github for more details of the model. 
         | 
| 16 | 
            +
             | 
| 17 | 
            +
            ## Citation
         | 
| 18 | 
            +
             | 
| 19 | 
            +
            ```bibtex
         | 
| 20 | 
            +
            @inproceedings{xu20254dgt,
         | 
| 21 | 
            +
                title     = {4DGT: Learning a 4D Gaussian Transformer Using Real-World Monocular Videos},
         | 
| 22 | 
            +
                author    = {Xu, Zhen and Li, Zhengqin and Dong, Zhao and Zhou, Xiaowei and Newcombe, Richard and Lv, Zhaoyang},
         | 
| 23 | 
            +
                journal   = {arXiv preprint arXiv:2506.08015},
         | 
| 24 | 
            +
                year      = {2025}
         | 
| 25 | 
            +
            }
         | 
| 26 | 
            +
            ```
         | 
| 27 | 
            +
             | 
| 28 | 
            +
            ## Model Files
         | 
| 29 | 
            +
             | 
| 30 | 
            +
            ### Checkpoint: `4dgt_full.pth`
         | 
| 31 | 
            +
            - **Size:** ~14.5 GB
         | 
| 32 | 
            +
            - **Format:** PyTorch state dict
         | 
| 33 | 
            +
            - **Contents:**
         | 
| 34 | 
            +
              - The full model trained as described in the paper.
         | 
| 35 | 
            +
              - Encoder weights (DINOv2 backbone)
         | 
| 36 | 
            +
              - Level of Details Transformer 
         | 
| 37 | 
            +
              - 4D Gaussian Decoder 
         | 
| 38 | 
            +
             | 
| 39 | 
            +
            ### Checkpoint: `4dgt_1st_stage.pth`
         | 
| 40 | 
            +
            - **Size:** ~4.85 GB
         | 
| 41 | 
            +
            - **Format:** PyTorch state dict
         | 
| 42 | 
            +
            - **Contents:**
         | 
| 43 | 
            +
              - The first stage model trained only using Egoexo4D dataset as described in the paper. 
         | 
| 44 | 
            +
              - Encoder weights (DINOv2 backbone)
         | 
| 45 | 
            +
              - Vanilla Transformer, no level of details. 
         | 
| 46 | 
            +
              - 4D Gaussian Decoder
         | 
| 47 | 
            +
             | 
| 48 | 
            +
            ## Quick Start
         | 
| 49 | 
            +
            Please refer to [4DGT GitHub repository](https://github.com/facebookresearch/4dgt) for the full set up. 
         | 
| 50 | 
            +
             | 
| 51 | 
            +
            ## Contact
         | 
| 52 | 
            +
            For questions and issues, please open an issue on the [GitHub repository](https://github.com/facebookresearch/4dgt).
         | 
