Intelligence & Robotics

Search Log In

Intelligence & Robotics

Figure9

From: Retrieve-then-compare mitigates visual hallucination in multi-modal large language models

Retrieve-then-compare mitigates visual hallucination in multi-modal large language models

Figure 9. Qualitative results on our proposed VQA benchmark. Our method enhances the accuracy of answers from three MLLMs across nearly all question categories. Human-annotated answers are provided for each test sample. Incorrect and correct answers are highlighted in red green, respectively. VQA: Visual Question Answering.

Intelligence & Robotics

ISSN 2770-3541 (Online)

[email protected]

Navigation

Follow Us

Navigation

Committee on Publication Ethics

https://members.publicationethics.org/members/intelligence-robotics

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/

Committee on Publication Ethics

https://members.publicationethics.org/members/intelligence-robotics

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/

[email protected]

Discover Content

Language Editing

Layout & Production

Graphical Abstracts

Video Abstracts

Conference Organizer

Strategic Collaborators

Follow OAE

© 2016-2025 OAE Publishing Inc., except certain content provided by third parties

Privacy Cookies Terms of Service