An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

Zhihu

Last update: Oct 20, 2022

Related tags

Overview

SERank

Compared with GSF(Groupwise Scoring Function), our method obtains comparable ranking performance gain, while only requiring little computation overhead.

The SERank model has been suceessfully deployed in Zhihu Search ranking, which is one of the largest Community Question Answering platform in China.

Dependencies

tensorflow-ranking >= 0.2.2
tensorflow >= 2.0

Dataset

The demo dataset in this repo is randomly sampled from MSLR web30k dataset. You may download the whole web30K dataset from Microsoft Learning to Rank Datasets Page and place train.txt, vali.txt, test.txt in the data folder.

Model

Our main idea is to develop a sequencewise model structure which accepts a list of ranking candidates, and jointly score all candidates. We introduce SENet structure into the ranking model, where the basic idea is using SENet structure to compute feature importance according the the context of ranking list.

For the detail about the model structure, you may refer to our paper published on arxiv.

How to Train

bash run_train.sh

Citation

SERank: Optimize Sequencewise Learning to Rank Using Squeeze-and-Excitation Network. [arxiv]

@article{wang2020serank,
  title={SERank: Optimize Sequencewise Learning to Rank Using Squeeze-and-Excitation Network},
  author={Wang, RuiXing and Fang, Kuan and Zhou, RiKang and Shen, Zhan and Fan, LiWen},
  journal={arXiv preprint arXiv:2006.04084},
  year={2020}
}

Comments

problem about Recurrence experiment in Web30k

Hi, thx for your great job. but I have problem about recurrence experiment in web30k. I download the code, and change data to web30k, ndcg@5 of the last ckpt is only 0.409, which is 0.456 in paper. I do not change serank.py. and the run_script is

DATA=/path-to-web30k/Fold1

output_dir=outputs
rm -r $output_dir
mkdir $output_dir

python serank.py \
  --train_path=$DATA/train.txt \
  --vali_path=$DATA/vali.txt \
  --test_path=$DATA/test.txt \
  --output_dir=$output_dir \
  --num_features=136 \
  --serank=True \
  --query_label_weight=True

tail of the train log is

I0322 18:19:51.462585 140648353728320 evaluation.py:276] Finished evaluation at 2021-03-22-18:19:51
INFO:tensorflow:Saving dict for global step 100000: global_step = 100000, labels_mean = 0.66818184, logits_mean = -0.27165264, loss = 258.99927, metric/arp = 38.71134, metric/ndcg@1 = 0.38938853, metric/ndcg@10 = 0.43623617, metric/ndcg@3 = 0.39571184, metric/ndcg@5 = 0.40858307, metric/ordered_pair_accuracy = 0.6484862
I0322 18:19:51.462839 140648353728320 estimator.py:2066] Saving dict for global step 100000: global_step = 100000, labels_mean = 0.66818184, logits_mean = -0.27165264, loss = 258.99927, metric/arp = 38.71134, metric/ndcg@1 = 0.38938853, metric/ndcg@10 = 0.43623617, metric/ndcg@3 = 0.39571184, metric/ndcg@5 = 0.40858307, metric/ordered_pair_accuracy = 0.6484862
INFO:tensorflow:Saving 'checkpoint_path' summary for global step 100000: outputs/model.ckpt-100000
I0322 18:19:51.463721 140648353728320 estimator.py:2127] Saving 'checkpoint_path' summary for global step 100000: outputs/model.ckpt-100000

thx for your help.

enhancement

opened by stanpcf 4

A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results

Bag of tricks for long-tailed visual recognition with deep convolutional neural networks This repository is the official PyTorch implementation of AAA

181 Dec 28, 2022

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars This repository is the official implementation of Colar. In this work,

246 Dec 13, 2022

ICCV2021 - Mining Contextual Information Beyond Image for Semantic Segmentation

Introduction The official repository for "Mining Contextual Information Beyond Image for Semantic Segmentation". Our full code has been merged into ss

55 Nov 9, 2022

This repository contains the source code of our work on designing efficient CNNs for computer vision

Efficient networks for Computer Vision This repo contains source code of our work on designing efficient networks for different computer vision tasks:

386 Nov 26, 2022

Source Code for DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances (https://arxiv.org/pdf/2012.01775.pdf)

DialogBERT This is a PyTorch implementation of the DialogBERT model described in DialogBERT: Neural Response Generation via Hierarchical BERT with Dis

67 Jan 6, 2023

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

SVM Données Une base d’images contient 490 images pour l’apprentissage (400 voitures et 90 bateaux), et encore 21 images pour fait des tests. Prétrait

3 Nov 30, 2021

This repository contains a pytorch implementation of "HeadNeRF: A Real-time NeRF-based Parametric Head Model (CVPR 2022)".

HeadNeRF: A Real-time NeRF-based Parametric Head Model This repository contains a pytorch implementation of "HeadNeRF: A Real-time NeRF-based Parametr

294 Jan 1, 2023

Open source implementation of AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing

AceNAS This repo is the experiment code of AceNAS, and is not considered as an official release. We are working on integrating AceNAS as a built-in st

6 Sep 7, 2022

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning".

8 Dec 8, 2022

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

Related tags

Overview

SERank

Dependencies

Dataset

Model

How to Train

Citation

You might also like...

A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.

ICCV2021 - Mining Contextual Information Beyond Image for Semantic Segmentation

This repository contains the source code of our work on designing efficient CNNs for computer vision

Source Code for DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances (https://arxiv.org/pdf/2012.01775.pdf)

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

This repository contains a pytorch implementation of "HeadNeRF: A Real-time NeRF-based Parametric Head Model (CVPR 2022)".

Open source implementation of AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

Comments

problem about Recurrence experiment in Web30k

Owner

Zhihu

MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.

CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection

Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman)

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

Gradient-free global optimization algorithm for multidimensional functions based on the low rank tensor train format

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

A deep learning library that makes face recognition efficient and effective