Intelligence & Robotics

Search Log In

Intelligence & Robotics

Figure10

From: Retrieve-then-compare mitigates visual hallucination in multi-modal large language models

Retrieve-then-compare mitigates visual hallucination in multi-modal large language models

Figure 10. Qualitative results on our proposed image captioning benchmark. Our method RCD improves the accuracy of image descriptions and reduces various types of visual hallucinations. Additionally, RCD enhances the specificity of the descriptions. Incorrect content is highlighted in red, while correct content is highlighted in green. RCD: Retrieval contrastive decoding.

Intelligence & Robotics

ISSN 2770-3541 (Online)

[email protected]

Navigation

Follow Us

Navigation

Committee on Publication Ethics

https://members.publicationethics.org/members/intelligence-robotics

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/

Committee on Publication Ethics

https://members.publicationethics.org/members/intelligence-robotics

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/

[email protected]

Discover Content

Language Editing

Layout & Production

Graphical Abstracts

Video Abstracts

Conference Organizer

Strategic Collaborators

Follow OAE

© 2016-2025 OAE Publishing Inc., except certain content provided by third parties

Privacy Cookies Terms of Service