By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: New Dataset Challenges AI to Explain the Humor and Sarcasm It ‘Sees’ and ‘Reads’ | HackerNoon
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > New Dataset Challenges AI to Explain the Humor and Sarcasm It ‘Sees’ and ‘Reads’ | HackerNoon
Computing

New Dataset Challenges AI to Explain the Humor and Sarcasm It ‘Sees’ and ‘Reads’ | HackerNoon

News Room
Last updated: 2025/06/18 at 5:41 PM
News Room Published 18 June 2025
Share
SHARE

Authors:

(1) Arkadiy Saakyan, Columbia University ([email protected]);

(2) Shreyas Kulkarni, Columbia University;

(3) Tuhin Chakrabarty, Columbia University;

(4) Smaranda Muresan, Columbia University.

Editor’s note: this is part 2 of 6 of a study looking at how well large AI models handle figurative language. Read the rest below.

Table of Links

Textual entailment (MacCartney and Manning, 2008; Bowman et al., 2015) and visual entailment (Xie et al., 2019) tasks have been proposed to measure language and multimodal understanding. However, models trained to simply improve label accuracy on these data can be brittle and suffer from spurious correlations (Poliak et al., 2018; Gururangan et al., 2018; McCoy et al., 2019; Gardner et al., 2021). Datasets such as e-SNLI (Camburu et al., 2018) and e-SNLI-VE (Kayser et al., 2021) augment existing entailment datasets with natural language explanations and train models to not only predict the label, but also generate a textual explanation for the reason behind the prediction. Such approach has been further adopted for a variety of tasks, such as commonsense reasoning (Rajani et al., 2019; Aggarwal et al., 2021) and social norm understanding (CHWang et al., 2023) among others (Wiegreffe and Marasovic, 2021). This approach has been extended to assess LLMs’ capabilities on understanding figurative language through the FLUTE dataset (Chakrabarty et al., 2022). FLUTE frames figurative language understanding as an explainable textual entailment task. Recent progress in multimodal models (Li et al., 2022; Alayrac et al., 2022; OpenAI, 2023; Team et al., 2023; Liu et al., 2023b; Anthropic, 2024) prompts us to asses similar capabilities when extended to multimodal setting, testing the understanding of nonliteral meaning contained in both images and text. We present an equivalent of the FLUTE dataset for the visual modality: V-FLUTE.

A number of previous works has focused on modeling figurative phenomena beyond text. Chakrabarty et al. (2023) use a human-AI collaboration framework to generate visual metaphors from linguistic metaphors (HAIVMet dataset) and propose a visual entailment task as an extrinsic evaluation of dataset quality. The dataset contains images, claims, and labels, but no textual explanations. Yosef et al. (2023) proposed a benchmark (IRFL) where given an idiom, metaphor, or simile the model has to distinguish which of the four associated images implies the figurative meaning of the expression. This dataset focuses on the figurative meaning in the textual modality and does not contain textual explanations. There has also been work on understanding multimodal sarcasm with explanations (Desai et al., 2022), mostly containing noisy user-generated text and crowdworkerwritten explanations. Other line of work has focused on understanding humor with multimodal models. MemeCap (Hwang and Shwartz, 2023) is a dataset for understanding memes. Hessel et al. (2023) release a corpus of annotated New Yorker Caption Contest entries, where the goal is to come

Table 1: V-FLUTE dataset composition: 5 figurative phenomena, source datasets, and our contributions. E denotes number of entailment instances, C - contradiction.Table 1: V-FLUTE dataset composition: 5 figurative phenomena, source datasets, and our contributions. E denotes number of entailment instances, C - contradiction.

up with a humorous captions for an image, with high-quality explanations for why the caption is humorous. The dataset is relatively limited in size containing only 520 unique instances in its training set. We leverage all these benchmarks to build V-FLUTE.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article My favorite midrange phone is now available for the same low price that made me fall in love with it
Next Article The EPA Plans to ‘Reconsider’ Ban on Cancer-Causing Asbestos
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Netflix reinvented the way of seeing series. In France, he has signed an agreement to return to Life TV
Mobile
Nightreign’s most brutal boss is there – and there is no time to waste
Mobile
Half of Japan’s chip-making equipment exports headed to China in Q1 · TechNode
Computing
Marvel's 'Ironheart': Release Date and How to Watch
News

You Might also Like

Computing

Half of Japan’s chip-making equipment exports headed to China in Q1 · TechNode

1 Min Read
Computing

AI content creator accuses China’s 360 Security of using his image without consent, firm plans legal action · TechNode

4 Min Read
Computing

Huawei leapfrogs Apple as HarmonyOS surpasses iOS market share in China · TechNode

1 Min Read
Computing

17-year-old fashion student beats prestigious college rivals and AI teams in Alibaba math competition · TechNode

1 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?