Fermi Problems: A New Reasoning Challenge for AI

AI2

Last update: May 28, 2022

Related tags

Deep Learning fermi

Overview

Fermi Problems: A New Reasoning Challenge for AI

Fermi Problems are questions whose answer is a number that can only be reasonably estimated as a precise measurement of the value is either impossible or impractical.

This repository provides two datasets of such fermi problems along with annotations for the solution:

RealFP @ ./data/realFP. A collection of 928 fermi problems and their solutions expressed in the form a program.
SynthFP @ .data/synthFP. An auxilliary set of 10000 templated fermi questions, created by the authors.

Code for compiling the program in the dataset and computing the accuracy metric is provided in eval_utils.py. For more details on the datasets, please refer to our paper: How Much Coffee Was Consumed During EMNLP 2019? Fermi Problems: A New Reasoning Challenge for AI.

Inference

You can download a model finetuned on the realFP dataset here. Answers to your fermi questions can be obtained by executing the following command: python inference --question your_question_here. Make sure to check requirements.txt for any dependencies.

If you use the datasets or any other content shared in this repository, please cite our work:

@article{kalyan2021much,
  title={How Much Coffee Was Consumed During EMNLP 2019? Fermi Problems: A New Reasoning Challenge for AI},
  author={Kalyan, Ashwin and Kumar, Abhinav and Chandrasekaran, Arjun and Sabharwal, Ashish and Clark, Peter},
  journal={arXiv preprint arXiv:2110.14207},
  year={2021}
}

You might also like...

Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning This is the Github repository of our paper, "Common S

19 Nov 30, 2022

Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"

Introduction Code and data for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning". We cons

81 Dec 27, 2022

Database Reasoning Over Text project for ACL paper

Database Reasoning over Text This repository contains the code for the Database Reasoning Over Text paper, to appear at ACL2021. Work is performed in

320 Dec 12, 2022

Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)

Pytorch implementation of Relational Networks - A simple neural network module for relational reasoning Implemented & tested on Sort-of-CLEVR task. So

800 Dec 5, 2022

The CLRS Algorithmic Reasoning Benchmark

Learning representations of algorithms is an emerging area of machine learning, seeking to bridge concepts from neural networks with classical algorithms.

251 Jan 5, 2023

🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"

SGLKT-VisDial Pytorch Implementation for the paper: Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer Gi-Cheon Kang, Junseok P

9 Jul 5, 2022

Comments

Some logical inconsistency in the first example of the val_realfp.json ?

Really great work!

There seems to be a small logical inconsistency in the first example in the valid set of RealFP. https://github.com/allenai/fermi/blob/a174bd1e7ddd5f707ef64b4a94b7c2278dea14aa/data/realFP/val_realfp.json#L1

Since Q3: What is the population density?, the following two statements Q3 -> Div (Q1, Q2), P: Pow(Q3, Q4)

should be converted to Q3 → Div (Q2, Q1), P: Pow(1/Q3, Q4)

as [Q1]=m**2 and [Q2]=# of people?

Or, more simply, maybe Q3 can be modified to something like "What is the inverse of the population density" I guess.

Thanks!

opened by whwang299 0

Fermi Problems: A New Reasoning Challenge for AI

Related tags

Overview

Fermi Problems: A New Reasoning Challenge for AI

Inference

You might also like...

Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"

Database Reasoning Over Text project for ACL paper

Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)

The CLRS Algorithmic Reasoning Benchmark

🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"

Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).

Phy-Q: A Benchmark for Physical Reasoning

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

Comments

Some logical inconsistency in the first example of the val_realfp.json ?

Owner

AI2

Code for the AAAI-2022 paper: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

ManiSkill-Learn is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge (ManiSkill Challenge), a large-scale learning-from-demonstrations benchmark for object manipulation.

The all new way to turn your boring vector meshes into the new fad in town; Voxels!

The code of “Similarity Reasoning and Filtration for Image-Text Matching” [AAAI2021]

Code for "FGR: Frustum-Aware Geometric Reasoning for Weakly Supervised 3D Vehicle Detection", ICRA 2021

Open-Ended Commonsense Reasoning (NAACL 2021)

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Deep Learning and Logical Reasoning from Data and Knowledge

[CVPR 2021] A Peek Into the Reasoning of Neural Networks: Interpreting with Structural Visual Concepts

PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"