Figure8

From: Retrieve-then-compare mitigates visual hallucination in multi-modal large language models

Figure 8. RCD effectively reduces visual hallucinations in detailed image descriptions. DoLa's response is omitted when it is identical to the greedy baseline. Correct and hallucinatory contents are highlighted in green and red, respectively. RCD: Retrieval contrastive decoding.

Intelligence & Robotics

ISSN 2770-3541 (Online)

editorial@intellrobot.com

Navigation

Sitemap

Navigation

Sitemap

Committee on Publication Ethics

https://members.publicationethics.org/members/intelligence-robotics

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/

Committee on Publication Ethics

https://members.publicationethics.org/members/intelligence-robotics

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/

partners@oaepublish.com

Discover Content

Strategic Collaborators

Follow OAE

Privacy Cookies Terms of Service

Your privacy, your choice

We use essential cookies to make sure the site can function. We also use optional cookies for advertising, personalisation of content, usage analysis, and social media.
By accepting optional cookies, you consent to the processing of your personal data - including transfers to third parties. Some third parties are outside of the European Economic Area, with varying standards of data protection. See our privacy policy for more information on the use of your personal data.