The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

Last update: Dec 24, 2022

Related tags

Deep Learning FMFCC-A

Overview

FMFCC-A

This project is the description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

The FMFCC-A dataset is shared through BaiduCloud (website: https://pan.baidu.com/s/1CGPkC8VfjXVBZjluEHsW6g , password: IIES). The FMFCC-A dataset is by far the largest publicly available Mandarin dataset for synthetic speech detection, which contains 40,000 synthesized Mandarin utterances that generated by 11 Mandarin TTS systems and two Mandarin VC systems, and 10,000 genuine Mandarin utterances collected from 58 speakers. In addition, the official website of FMFCC-A (Audio track of the first fake media forensic challenge of China Society of Image and Graphics) is http://fmfcc.net/ . We hope that the FMFCC-A dataset can fill the gap of lack of Mandarin datasets for synthetic speech detection under various audio post-processing operations.

If you find the code or dataset is usefull, please cite the following papers: FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection

Comments

speaker tag

Dear author,

Thank your very much for the work on FMFCC-A database! Very useful to the community.

Could you provide the speaker tag like the tag file of asvspoof2019？

Thank you!

opened by hello-xiaow 2
数据集细节缺失

Dear author，

Thanks for your works in FMFCC-A dataset constructing.

However, I found that details (eg. spoof algorithm, post-processing) corresponding to each audio file are not marked.

I wonder if it is possible for you to publish more details about the dataset？

Thank you~

opened by petrichorwq 2
About the train/dev/eval protocols for FMFCC-A database

Dear author,

Thank your very much for the work on FMFCC-A database! Very useful to the community.

I've downloaded the database following the link on baidupan. After unrar the file, I noticed that there are wave files and a FMFCC-AdatasetLabel.txt file (filename, 0/1)

Do you consider releasing the protocols for the training, development, eval sets?

Thank you!

opened by TonyWangX 2

This is the official repo for TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transformations at CVPR'21. According to some product reasons, we are not planning to release the training/testing codes and models. However, we will release the dataset and the scripts to prepare the dataset.

TransFill-Reference-Inpainting This is the official repo for TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transf

80 Dec 8, 2022

Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

Portrait Photo Retouching with PPR10K Paper | Supplementary Material PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask an

184 Dec 11, 2022

This is the dataset and code release of the OpenRooms Dataset.

95 Jan 8, 2023

A large dataset of 100k Google Satellite and matching Map images, resembling pix2pix's Google Maps dataset.

Larger Google Sat2Map dataset This dataset extends the aerial ⟷ Maps dataset used in pix2pix (Isola et al., CVPR17). The provide script download_sat2m

34 Dec 28, 2022

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

SLATE This is the official source code for SLATE. We provide the code for the model, the training code and a dataset loader for the 3D Shapes dataset.

66 Dec 26, 2022

Dataset used in "PlantDoc: A Dataset for Visual Plant Disease Detection" accepted in CODS-COMAD 2020

PlantDoc: A Dataset for Visual Plant Disease Detection This repository contains the Cropped-PlantDoc dataset used for benchmarking classification mode

109 Dec 29, 2022

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

MADE (Multi-Adapter Dataset Experts) This repository contains the implementation of MADE (Multi-adapter dataset experts), which is described in the pa

68 Jul 18, 2022

39 Oct 5, 2021

The Habitat-Matterport 3D Research Dataset - the largest-ever dataset of 3D indoor spaces.

Habitat-Matterport 3D Dataset (HM3D) The Habitat-Matterport 3D Research Dataset is the largest-ever dataset of 3D indoor spaces. It consists of 1,000

62 Dec 27, 2022

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

Related tags

Overview

FMFCC-A

You might also like...

Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

This is the dataset and code release of the OpenRooms Dataset.

A large dataset of 100k Google Satellite and matching Map images, resembling pix2pix's Google Maps dataset.

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

Dataset used in "PlantDoc: A Dataset for Visual Plant Disease Detection" accepted in CODS-COMAD 2020

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

The Habitat-Matterport 3D Research Dataset - the largest-ever dataset of 3D indoor spaces.

Comments

speaker tag

数据集细节缺失

About the train/dev/eval protocols for FMFCC-A database

Owner

AI grand challenge 2020 Repo (Speech Recognition Track)

🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)

Tracking code for the winner of track 1 in the MMP-Tracking Challenge at ICCV 2021 Workshop.

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

ManiSkill-Learn is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge (ManiSkill Challenge), a large-scale learning-from-demonstrations benchmark for object manipulation.

Joint deep network for feature line detection and description

Code and description for my BSc Project, September 2021

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset.

Facestar dataset. High quality audio-visual recordings of human conversational speech.