finetuning on a classification task #35

Sravanthgithub · 2024-01-22T11:01:27Z

Hey, I have some data of images and videos and i want these to get alligned with text. My usecase is just a binary classification. So, my texts are nothing but two sentences - 'The data is live' , 'The data is non live'. So, basically i wanted to increase my model's performance by utilising a multi-modality model. How do i do this? Any resources?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetuning on a classification task #35

finetuning on a classification task #35

Sravanthgithub commented Jan 22, 2024

finetuning on a classification task #35

finetuning on a classification task #35

Comments

Sravanthgithub commented Jan 22, 2024