Neelabh Sinha
  • About
  • Publications
  • Experience
  • Projects
  • Skills
  • Certifications
  • Publications
    • Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types
    • Are Small Language Models Ready to Compete with Large Language Models for Practical Applications?
    • Multimodal Personality Recognition using Cross-attention Transformer and Behaviour Encoding
    • FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation
  • Projects
  • Projects
    • PEFT Strikes Back - Exploring Efficient Finetuning of Language Models
    • A Multi-Stage Vision-Language Framework for Knowledge-based Visual Question Answering
      • Predicting FIFA World Cup Outcomes
      • Drowsiness Detection in Drivers
      • Design of Model Predictive RBFN Controller for Non-linear Plants
      • Private Chat Application using MongoDB and Socket.io
      • Leakage Detection in Smart Water-Distribution Systems
      • News-text Classification using a Weighted RNN
    • Experience

    On this page

      A Multi-Stage Vision-Language Framework for Knowledge-based Visual Question Answering

      Dec 5, 2023 · 1 min read
      Go to Project Site

      Add more content to display. Leave blank to directly redirect to the page.

      Last updated on Dec 5, 2023
      Natural Language Processing Computer Vision Machine Learning Deep Learning Question Answering
      Neelabh Sinha
      Authors
      Neelabh Sinha
      Graduate Student

      ← PEFT Strikes Back - Exploring Efficient Finetuning of Language Models Apr 30, 2024
      Predicting FIFA World Cup Outcomes Nov 30, 2023 →

      © Neelabh Sinha. All rights reserved.

      Published with Hugo Blox Builder — the free, open source website builder that empowers creators.