OpenGVLab
/

InternVideo2_Chat_8B_InternLM2_5

Video-Text-to-Text

Model card Files Files and versions

ynhe commited on Sep 19, 2024

Commit

095df6f

·

verified ·

1 Parent(s): de18473

[fix] fix image ilen

Files changed (1) hide show

modeling_videochat2.py +3 -1

modeling_videochat2.py CHANGED Viewed

@@ -257,7 +257,7 @@ class InternVideo2_VideoChat2(BaseMLLM):
       return_history =False,
       generation_config={}
     ):
-        ilen = media_tensor.shape[1]
         conversation = ""
         if instruction:
@@ -268,8 +268,10 @@ class InternVideo2_VideoChat2(BaseMLLM):
                 )
         if media_type == 'image':
             conversation +=( "<img>" + IMG_TOKEN + "</img>")*ilen
         else:
             conversation += ("<vid>" + VID_TOKEN + "</vid>")*ilen

       return_history =False,
       generation_config={}
     ):
         conversation = ""
         if instruction:
                 )
         if media_type == 'image':
+            ilen = media_tensor.shape[0]
             conversation +=( "<img>" + IMG_TOKEN + "</img>")*ilen
         else:
+            ilen = media_tensor.shape[1]
             conversation += ("<vid>" + VID_TOKEN + "</vid>")*ilen