nvidia
/

PhysicalAI-Robotics-mindmap-Checkpoints

Model card Files Files and versions

xet

Community

update model card

by remostei - opened Oct 1

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

+38

-19

Files changed (1) hide show

README.md +38 -19

README.md CHANGED Viewed

@@ -11,11 +11,12 @@ datasets:
 ### Description:
-``mindmap`` is a 3D diffusion policy that generates robot trajectories based on a semantic 3D reconstruction of the environment,
-enabling robots with spatial memory.
 Trained models are available on Hugging Face: [PhysicalAI-Robotics-mindmap-Checkpoints](https://huggingface.co/nvidia/PhysicalAI-Robotics-mindmap-Checkpoints)
 ### License/Terms of Use
 - Model: [NVIDIA Open Model License Agreement](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/)
@@ -33,6 +34,11 @@ The trained ``mindmap`` policies allow for quick evaluation of the ``mindmap`` c
 - Developers: Integrate and customize AI for various robotic applications.
 - Startups & Companies: Accelerate robotics development and reduce training costs.
 ## References(s):
@@ -90,6 +96,9 @@ The trained ``mindmap`` policies allow for quick evaluation of the ``mindmap`` c
 - Gripper: `[PREDICTION_HORIZON, NUM_GRIPPERS, 8]` - consisting of end-effector translation, rotation (quaternion, wxyz) and closedness
 - Head Yaw: `[PREDICTION_HORIZON, 1]` - only for humanoid embodiments
 ## Software Integration:
 **Runtime Engine(s):** PyTorch
@@ -108,6 +117,10 @@ The trained ``mindmap`` policies allow for quick evaluation of the ``mindmap`` c
 **Preferred/Supported Operating System(s):**
 * Linux
 ## Model Version(s):
 This is the initial version of the model, version 1.0.0
@@ -120,7 +133,24 @@ Datasets:
 - drill_in_box_checkpoint: [GR1 Drill in Box Dataset](https://huggingface.co/datasets/nvidia/PhysicalAI-Robotics-mindmap-GR1-Drill-in-Box)
 - stick_in_bin_checkpoint: [GR1 Stick in Bin Dataset](https://huggingface.co/datasets/nvidia/PhysicalAI-Robotics-mindmap-GR1-Stick-in-Bin)
-The models were trained on 100 (GR1) and 130 (Franka) demonstrations. The evaluation set consisted of 20 distinct demonstrations. Closed loop testing was performed on 100 demonstrations mutually exclusive from the training set.
 # Inference:
@@ -132,9 +162,9 @@ The models were trained on 100 (GR1) and 130 (Franka) demonstrations. The evalua
 This model is not tested or intended for use in mission critical applications that require functional safety. The use of the model in those applications is at the user's own risk and sole responsibility, including taking the necessary steps to add needed guardrails or safety mechanisms.
-- Risk: This policy is only effective on the exact simulation environment it was trained on.
-    - Mitigation: Need to retrain the model on new simulation environments.
-- Risk: The policy was never tested on a physical robot and likely only works in simulation
     - Mitigation: Expand training, testing and validation on physical robot platforms.
 ## Ethical Considerations:
@@ -143,19 +173,14 @@ NVIDIA believes Trustworthy AI is a shared responsibility and we have establishe
 For more detailed information on ethical considerations for this model, please see the Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards.
-Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/).
 # Bias
 Field                                                                                               |  Response
 :---------------------------------------------------------------------------------------------------|:---------------
 Participation considerations from adversely impacted groups [protected classes](https://www.senate.ca.gov/content/protected-classes) in model design and testing:  |  Not Applicable
 Bias Metric (If Measured):                                                   |  Not Applicable
-(For GPAI Models) Which characteristic (feature) show(s) the greatest difference in performance?: |  Not Applicable
-(For GPAI Models): Which feature(s) have have the worst performance overall? | Not Applicable
 Measures taken to mitigate against unwanted bias:                                                   |  Not Applicable
-(For GPAI Models): If using internal data, description of methods implemented in data acquisition or processing, if any, to address the prevalence of identifiable biases in the training, testing, and validation data: | Not Applicable
-(For GPAI Models): Tools used to assess statistical imbalances and highlight patterns that may introduce bias into AI models: | Not Applicable
-(For GPAI Models): Tools used to assess statistical imbalances and highlight patterns that may introduce bias into AI models: | Not Applicable
 # Explainability
 Field                                                                                                  |  Response
@@ -164,10 +189,9 @@ Intended Task/Domain:
 Model Type:                                                                                            |  Denoising Diffusion Probabilistic Model
 Intended Users:                                                                                        |  Roboticists and researchers in academia and industry who are interested in robot manipulation research
 Output:                                                                                                |  Actions consisting of end-effector poses, gripper states and head orientation.
-(For GPAI Models): Tools used to evaluate datasets to identify synthetic data and ensure data authenticity. | Not Applicable
 Describe how the model works:                                                                          |  ``mindmap`` is a Denoising Diffusion Probabilistic Model that samples robot trajectories conditioned on sensor observations and a 3D reconstruction of the environment.
 Name the adversely impacted groups this has been tested to deliver comparable outcomes regardless of:  |  Not Applicable
-Technical Limitations & Mitigation:                                                                    |  - The policy is only effective on the exact simulation environment it was trained on. - The policy was never tested on a physical robot and likely only works in simulation.
 Verified to have met prescribed NVIDIA quality standards:  |  Yes
 Performance Metrics:                                                                                   |  Closed loop success rate on simulated robotic manipulation tasks.
 Potential Known Risks:                                                                                 |  The model might be susceptible to rendering changes on the simulation tasks it was trained on.
@@ -177,9 +201,6 @@ Licensing:
 Field                                               |  Response
 :---------------------------------------------------|:----------------------------------
 Model Application Field(s):                               |  Robotics
-Describe the life critical impact (if present).   |  Not Applicable
-(For GPAI Models): Description of methods implemented in data acquisition or processing, if any, to address other types of potentially harmful data in the training, testing, and validation data: | Not GPAI
-(For GPAI Models): Description of any methods implemented in data acquisition or processing, if any, to address illegal or harmful content in the training data, including, but not limited to, child sexual abuse material (CSAM) and non-consensual intimate imagery (NCII) | Not GPAI
 Use Case Restrictions:                              |  Abide by [NVIDIA Open Model License Agreement](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/)
 Model and dataset restrictions:            |  The Principle of least privilege (PoLP) is applied limiting access for dataset generation and model development.  Restrictions enforce dataset access during training, and dataset license constraints adhered to.
@@ -188,8 +209,6 @@ Field
 :----------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------
 Generatable or reverse engineerable personal data?                                                     |  No
 Personal data used to create this model?                                                                                       |  No
-Was consent obtained for any personal data used?                                                                                             |  Not Applicable
-(For GPAI Models): A description of any methods implemented in data acquisition or processing, if any, to address the prevalence of personal data in the training data, where relevant and applicable. | Not Applicable
 How often is dataset reviewed?                                                                                                     |  Before Release
 Is there provenance for all datasets used in training?                                                                                |  Yes
 Does data labeling (annotation, metadata) comply with privacy laws?                                                                |  Yes

 ### Description:
+``mindmap`` is a 3D diffusion policy that generates robot trajectories based on a semantic 3D reconstruction of the environment, enabling robots with spatial memory.
 Trained models are available on Hugging Face: [PhysicalAI-Robotics-mindmap-Checkpoints](https://huggingface.co/nvidia/PhysicalAI-Robotics-mindmap-Checkpoints)
+This model is ready for commercial/non-commercial use
 ### License/Terms of Use
 - Model: [NVIDIA Open Model License Agreement](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/)
 - Developers: Integrate and customize AI for various robotic applications.
 - Startups & Companies: Accelerate robotics development and reduce training costs.
+### Release Date
+Github 09/26/2025 via github.com/NVlabs/nvblox_mindmap
+Hugging Face 09/26/2025 via huggingface.co/nvidia/PhysicalAI-Robotics-mindmap-Checkpoints
 ## References(s):
 - Gripper: `[PREDICTION_HORIZON, NUM_GRIPPERS, 8]` - consisting of end-effector translation, rotation (quaternion, wxyz) and closedness
 - Head Yaw: `[PREDICTION_HORIZON, 1]` - only for humanoid embodiments
+Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated systems.
+By leveraging NVIDIA’s hardware (e.g. GPU cores) and software frameworks (e.g., CUDA libraries),
+the model achieves faster training and inference times compared to CPU-only solutions.
 ## Software Integration:
 **Runtime Engine(s):** PyTorch
 **Preferred/Supported Operating System(s):**
 * Linux
+The integration of foundation and fine-tuned models into AI systems requires additional testing using use-case-specific data to ensure safe and effective deployment.
+Following the V-model methodology, iterative testing and validation at both unit and system levels are essential to mitigate risks,
+meet technical and functional requirements, and ensure compliance with safety and ethical standards before deployment.
 ## Model Version(s):
 This is the initial version of the model, version 1.0.0
 - drill_in_box_checkpoint: [GR1 Drill in Box Dataset](https://huggingface.co/datasets/nvidia/PhysicalAI-Robotics-mindmap-GR1-Drill-in-Box)
 - stick_in_bin_checkpoint: [GR1 Stick in Bin Dataset](https://huggingface.co/datasets/nvidia/PhysicalAI-Robotics-mindmap-GR1-Stick-in-Bin)
+**Data Modality:** Image, 3D reconstruction, robot states
+**Image Training Data Size:** Less than a Million Images
+**3D reconstruction, robot state Data Size:** Less than a Million Samples
+**Data Collection Method by dataset:**
+* Synthetic
+* Human teleoperation
+* Automatic trajectory generation
+**Properties:**
+The models were trained on 100 (GR1) and 130 (Franka) demonstrations.
+The evaluation set consisted of 20 distinct demonstrations.
+Closed loop testing was performed on 100 demonstrations mutually exclusive from the training set.
+The training data is synthetic only and fully generated in Isaac Lab.
 # Inference:
 This model is not tested or intended for use in mission critical applications that require functional safety. The use of the model in those applications is at the user's own risk and sole responsibility, including taking the necessary steps to add needed guardrails or safety mechanisms.
+- Limitation: This policy is only effective in the exact simulation environment in which it was trained.
+    - Mitigation: Recommended to retrain the model in new simulation environments.
+- Limitation: The policy was not tested on a physical robot and likely only works in simulation.
     - Mitigation: Expand training, testing and validation on physical robot platforms.
 ## Ethical Considerations:
 For more detailed information on ethical considerations for this model, please see the Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards.
+Please report model quality, risk, security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/).
 # Bias
 Field                                                                                               |  Response
 :---------------------------------------------------------------------------------------------------|:---------------
 Participation considerations from adversely impacted groups [protected classes](https://www.senate.ca.gov/content/protected-classes) in model design and testing:  |  Not Applicable
 Bias Metric (If Measured):                                                   |  Not Applicable
 Measures taken to mitigate against unwanted bias:                                                   |  Not Applicable
 # Explainability
 Field                                                                                                  |  Response
 Model Type:                                                                                            |  Denoising Diffusion Probabilistic Model
 Intended Users:                                                                                        |  Roboticists and researchers in academia and industry who are interested in robot manipulation research
 Output:                                                                                                |  Actions consisting of end-effector poses, gripper states and head orientation.
 Describe how the model works:                                                                          |  ``mindmap`` is a Denoising Diffusion Probabilistic Model that samples robot trajectories conditioned on sensor observations and a 3D reconstruction of the environment.
 Name the adversely impacted groups this has been tested to deliver comparable outcomes regardless of:  |  Not Applicable
+Technical Limitations & Mitigation:                                                                    |  - Limitation: This policy is only effective in the exact simulation environment in which it was trained. Mitigation: Recommended to retrain the model in new simulation environments. - Limitation: The policy was not tested on a physical robot and likely only works in simulation. Mitigation: Expand training, testing and validation on physical robot platforms.
 Verified to have met prescribed NVIDIA quality standards:  |  Yes
 Performance Metrics:                                                                                   |  Closed loop success rate on simulated robotic manipulation tasks.
 Potential Known Risks:                                                                                 |  The model might be susceptible to rendering changes on the simulation tasks it was trained on.
 Field                                               |  Response
 :---------------------------------------------------|:----------------------------------
 Model Application Field(s):                               |  Robotics
 Use Case Restrictions:                              |  Abide by [NVIDIA Open Model License Agreement](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/)
 Model and dataset restrictions:            |  The Principle of least privilege (PoLP) is applied limiting access for dataset generation and model development.  Restrictions enforce dataset access during training, and dataset license constraints adhered to.
 :----------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------
 Generatable or reverse engineerable personal data?                                                     |  No
 Personal data used to create this model?                                                                                       |  No
 How often is dataset reviewed?                                                                                                     |  Before Release
 Is there provenance for all datasets used in training?                                                                                |  Yes
 Does data labeling (annotation, metadata) comply with privacy laws?                                                                |  Yes