Sarthak Mittal
  • Projects
  • Internships
  • Research
  • TAships
  • About Me
Navigation bar avatar

Audio-Visual Instance Discrimination

CS753 (Automatic Speech Recognition), IIT Bombay

Posted on May 1, 2023

Audio-Visual Instance Discrimination

CS753 (Automatic Speech Recognition), IIT Bombay

Posted on May 1, 2023

Code

Description

We evaluated the AVID-CMA model on HMDB-51 dataset by tuning the training parameters to improve the accuracy slightly. We implemented a combination of Cross-Entropy Loss and MSE-Loss in order to balance the loss function for this particular dataset.

Contributors

Lakshya Gupta
Parshant Arora

Image credits

  • Imgur and BeFunky
  • https://arxiv.org/pdf/2004.12943.pdf
  • Email me
  • GitHub
  • LinkedIn
  • ORCID

Sarthak Mittal  •  2024  •  sarthakmittal92.github.io

Powered by Beautiful Jekyll