EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

Yassir BENDOU

Last update: Dec 26, 2022

Related tags

Deep Learning easy

Overview

EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

This repository is the official implementation of EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

EASY proposes a simple methodology, that reaches or even beats state of the art performance on multiple standardized benchmarks of the field, while adding almost no hyperparameters or parameters to those used for training the initial deep learning models on the generic dataset.

Downloads

Please click the Google Drive link for downloading the features, backbones and datasets.

Each of the files (backbones and features) have the following prefixes depending on the backbone:

Backbone	prefix	Number of parameters
ResNet12		12M
ResNet12(1/sqrt(2))	small	6M
ResNet12(1/2)	tiny	3M

Each of the features file is named as follow :

if not AS : " features .pt11"
if AS : " featuresAS .pt11"

Testing scripts for EASY

Run scripts to evaluate the features on FSL tasks for Y and ASY. For EY and EASY use the corresponding features.

Inductive setup using NCM

Test features on miniimagenet using Y (Resnet12)

" --dataset miniimagenet --model resnet12 --test-features ' /minifeatures1.pt11' --preprocessing ME">

$ python main.py --dataset-path "
     
      " --dataset miniimagenet --model resnet12 --test-features '
      
       /minifeatures1.pt11' --preprocessing ME

Test features on miniimagenet using ASY (Resnet12)

" --dataset miniimagenet --model resnet12 --test-features ' /minifeaturesAS1.pt11' --preprocessing ME">

$ python main.py --dataset-path "
     
      " --dataset miniimagenet --model resnet12 --test-features '
      
       /minifeaturesAS1.pt11' --preprocessing ME

Test features on miniimagenet using EY (3xResNet12)

" --dataset miniimagenet --model resnet12 --test-features "[ /minifeatures1.pt11, /minifeatures2.pt11, /minifeatures3.pt11]" --preprocessing ME">

$ python main.py --dataset-path "
       
        " --dataset miniimagenet --model resnet12 --test-features "[
        
         /minifeatures1.pt11, 
         
          /minifeatures2.pt11, 
          
           /minifeatures3.pt11]" --preprocessing ME

Test features on miniimagenet using EASY (3xResNet12)

" --dataset miniimagenet --model resnet12 --test-features "[ /minifeaturesAS1.pt11, /minifeaturesAS2.pt11, /minifeaturesAS3.pt11]" --preprocessing ME ">

$ python main.py --dataset-path "
       
        " --dataset miniimagenet --model resnet12 --test-features "[
        
         /minifeaturesAS1.pt11, 
         
          /minifeaturesAS2.pt11, 
          
           /minifeaturesAS3.pt11]" --preprocessing ME

Transductive setup using Soft k-means

Test features on miniimagenet using Y (ResNet12)

" --dataset miniimagenet --model resnet12 --test-features ' /minifeatures1.pt11'--postprocessing ME --transductive --transductive-softkmeans --transductive-temperature-softkmeans 20">

$ python main.py --dataset-path "
     
      " --dataset miniimagenet --model resnet12 --test-features '
      
       /minifeatures1.pt11'--postprocessing ME --transductive --transductive-softkmeans --transductive-temperature-softkmeans 20

Test features on miniimagenet using ASY (ResNet12)

" --dataset miniimagenet --model resnet12 --test-features ' /minifeaturesAS1.pt11' --postprocessing ME --transductive --transductive-softkmeans --transductive-temperature-softkmeans 20">

$ python main.py --dataset-path "
     
      " --dataset miniimagenet --model resnet12 --test-features '
      
       /minifeaturesAS1.pt11' --postprocessing ME --transductive --transductive-softkmeans --transductive-temperature-softkmeans 20

Test features on miniimagenet using EY (3xResNet12)

" --dataset miniimagenet --model resnet12 --test-features "[ /minifeatures1.pt11, /minifeatures2.pt11, /minifeatures3.pt11]" --postrocessing ME --transductive --transductive-softkmeans --transductive-temperature-softkmeans 20">

$ python main.py --dataset-path "
       
        " --dataset miniimagenet --model resnet12 --test-features "[
        
         /minifeatures1.pt11, 
         
          /minifeatures2.pt11, 
          
           /minifeatures3.pt11]" --postrocessing ME  --transductive --transductive-softkmeans --transductive-temperature-softkmeans 20

Test features on miniimagenet using EASY (3xResNet12)

" --dataset miniimagenet --model resnet12 --test-features "[ /minifeaturesAS1.pt11, /minifeaturesAS2.pt11, /minifeaturesAS3.pt11]" --postrocessing ME --transductive --transductive-softkmeans --transductive-temperature-softkmeans 20">

$ python main.py --dataset-path "
       
        " --dataset miniimagenet --model resnet12 --test-features "[
        
         /minifeaturesAS1.pt11, 
         
          /minifeaturesAS2.pt11, 
          
           /minifeaturesAS3.pt11]" --postrocessing ME  --transductive --transductive-softkmeans --transductive-temperature-softkmeans 20

Training scripts for Y

Train a model on miniimagenet using manifold mixup, self-supervision and cosine scheduler

" --dataset miniimagenet --model resnet12 --epochs 0 --manifold-mixup 500 --rotations --cosine --gamma 0.9 --milestones 100 --batch-size 128 --preprocessing ME ">

$ python main.py --dataset-path "
    
     " --dataset miniimagenet --model resnet12 --epochs 0 --manifold-mixup 500 --rotations --cosine --gamma 0.9 --milestones 100 --batch-size 128 --preprocessing ME

Important Arguments

Some important arguments for our code.

Training arguments

dataset: choices=['miniimagenet', 'cubfs','tieredimagenet', 'fc100', 'cifarfs']
model: choices=['resnet12', 'resnet18', 'resnet20', 'wideresnet', 's2m2r']
dataset-path: path of the datasets folder which contains folders of all the datasets.

Few-shot Classification

preprocessing: preprocessing sequence for few shot given as a string, can contain R:relu P:sqrt E:sphering and M:centering using the base data.
postprocessing: postprocessing sequence for few shot given as a string, can contain R:relu P:sqrt E:sphering and M:centering on the few-shot data, used for transductive setting.

Few-shot classification Results

Experimental results on few-shot learning datasets with ResNet-12 backbone. We report our average results with 10000 randomly sampled episodes for both 1-shot and 5-shot evaluations.

MiniImageNet Dataset (inductive)

Methods	1-Shot 5-Way	5-Shot 5-Way
SimpleShot [29]	62.85 ± 0.20	80.02 ± 0.14
Baseline++ [30]	53.97 ± 0.79	75.90 ± 0.61
TADAM [35]	58.50 ± 0.30	76.70 ± 0.30
ProtoNet [10]	60.37 ± 0.83	78.02 ± 0.57
R2-D2 (+ens) [20]	64.79 ± 0.45	81.08 ± 0.32
FEAT [36]	66.78	82.05
CNL [37]	67.96 ± 0.98	83.36 ± 0.51
MERL [38]	67.40 ± 0.43	83.40 ± 0.28
Deep EMD v2 [13]	68.77 ± 0.29	84.13 ± 0.53
PAL [8]	69.37 ± 0.64	84.40 ± 0.44
inv-equ [39]	67.28 ± 0.80	84.78 ± 0.50
CSEI [40]	68.94 ± 0.28	85.07 ± 0.50
COSOC [9]	69.28 ± 0.49	85.16 ± 0.42
EASY 2×ResNet12 1/√2 (ours)	70.63 ± 0.20	86.28 ± 0.12
above <=12M	nb of parameters	below 36M
3S2M2R [12]	64.93 ± 0.18	83.18 ± 0.11
LR + DC [17]	68.55 ± 0.55	82.88 ± 0.42
EASY 3×ResNet12 (ours)	71.75 ± 0.19	87.15 ± 0.12

TieredImageNet Dataset (inductive)

Methods	1-Shot 5-Way	5-Shot 5-Way
SimpleShot [29]	69.09 ± 0.22	84.58 ± 0.16
ProtoNet [10]	65.65 ± 0.92	83.40 ± 0.65
FEAT [36]	70.80 ± 0.23	84.79 ± 0.16
PAL [8]	72.25 ± 0.72	86.95 ± 0.47
DeepEMD v2 [13]	74.29 ± 0.32	86.98 ± 0.60
MERL [38]	72.14 ± 0.51	87.01 ± 0.35
COSOC [9]	73.57 ± 0.43	87.57 ± 0.10
CNL [37]	73.42 ± 0.95	87.72 ± 0.75
invariance-equivariance [39]	72.21 ± 0.90	87.08 ± 0.58
CSEI [40]	73.76 ± 0.32	87.83 ± 0.59
ASY ResNet12 (ours)	74.31 ± 0.22	87.86 ± 0.15
above <=12M	nb of parameters	below 36M
S2M2R [12]	73.71 ± 0.22	88.52 ± 0.14
EASY 3×ResNet12 (ours)	74.71 ± 0.22	88.33 ± 0.14

CUBFS Dataset (inductive)

Methods	1-Shot 5-Way	5-Shot 5-Way
FEAT [36]	68.87 ± 0.22	82.90 ± 0.10
LaplacianShot [41]	80.96	88.68
ProtoNet [10]	66.09 ± 0.92	82.50 ± 0.58
DeepEMD v2 [13]	79.27 ± 0.29	89.80 ± 0.51
EASY 4×ResNet12 1/sqrt(2)	77.97 ± 0.20	91.59 ± 0.10
above <=12M	nb of parameters	below 36M
S2M2R [12]	80.68 ± 0.81	90.85 ± 0.44
EASY 3×ResNet12 (ours)	78.56 ± 0.19	91.93 ± 0.10

CIFAR-FS Dataset (inductive)

Methods	1-Shot 5-Way	5-Shot 5-Way
S2M2R [12]	63.66 ± 0.17	76.07 ± 0.19
R2-D2 (+ens) [20]	76.51 ± 0.47	87.63 ± 0.34
invariance-equivariance [39]	77.87 ± 0.85	89.74 ± 0.57
EASY 2×ResNet12 1/sqrt(2) (ours)	75.24 ± 0.20	88.38 ± 0.14
above <=12M	nb of parameters	below 36M
S2M2R [12]	74.81 ± 0.19	87.47 ± 0.13
EASY 3×ResNet12 (ours)	76.20 ± 0.20	89.00 ± 0.14

FC-100 Dataset (inductive)

Methods	1-Shot 5-Way	5-Shot 5-Way
DeepEMD v2 [13]	46.60 ± 0.26	63.22 ± 0.71
TADAM [35]	40.10 ± 0.40	56.10 ± 0.40
ProtoNet [10]	41.54 ± 0.76	57.08 ± 0.76
invariance-equivariance [39]	47.76 ± 0.77	65.30 ± 0.76
R2-D2 (+ens) [20]	44.75 ± 0.43	59.94 ± 0.41
EASY 2×ResNet12 1/sqrt(2) (ours)	47.94 ± 0.19	64.14 ± 0.19
above <=12M	nb of parameters	below 36M
EASY 3×ResNet12 (ours)	48.07 ± 0.19	64.74 ± 0.19

Minimagenet (transductive)

Methods	1-Shot 5-Way	5-Shot 5-Way
TIM-GD [42]	73.90	85.00
ODC [43]	77.20 ± 0.36	87.11 ± 0.42
PEMnE-BMS∗ [32]	80.56 ± 0.27	87.98 ± 0.14
SSR [44]	68.10 ± 0.60	76.90 ± 0.40
iLPC [45]	69.79 ± 0.99	79.82 ± 0.55
EPNet [31]	66.50 ± 0.89	81.60 ± 0.60
DPGN [46]	67.77 ± 0.32	84.60 ± 0.43
ECKPN [47]	70.48 ± 0.38	85.42 ± 0.46
Rot+KD+POODLE [48]	77.56	85.81
EASY 2×ResNet12( 1√2) (ours)	81.70 ±0.25	88.29 ±0.13
above <=12M	nb of parameters	below 36M
SSR [44]	72.40 ± 0.60	80.20 ± 0.40
fine-tuning(train+val) [49]	68.11 ± 0.69	80.36 ± 0.50
SIB+E3BM [50]	71.40	81.20
LR+DC [17]	68.57 ± 0.55	82.88 ± 0.42
EPNet [31]	70.74 ± 0.85	84.34 ± 0.53
TIM-GD [42]	77.80	87.40
PT+MAP [51]	82.92 ± 0.26	88.82 ± 0.13
iLPC [45]	83.05 ± 0.79	88.82 ± 0.42
ODC [43]	80.64 ± 0.34	89.39 ± 0.39
PEMnE-BMS∗ [32]	83.35 ± 0.25	89.53 ± 0.13
EASY 3×ResNet12 (ours)	82.75 ±0.25	88.93 ±0.12

CUB-FS (transductive)

Methods	1-Shot 5-Way	5-Shot 5-Way
TIM-GD [42]	82.20	90.80
ODC [43]	85.87	94.97
DPGN [46]	75.71 ± 0.47	91.48 ± 0.33
ECKPN [47]	77.43 ± 0.54	92.21 ± 0.41
iLPC [45]	89.00 ± 0.70	92.74 ± 0.35
Rot+KD+POODLE [48]	89.93	93.78
EASY 4×ResNet12( 1/2) (ours)	90.41 ± 0.19	93.58 ± 0.10
above <=12M	nb of parameters	below 36M
LR+DC [17]	79.56 ± 0.87	90.67 ± 0.35
PT+MAP [51]	91.55 ± 0.19	93.99 ± 0.10
iLPC [45]	91.03 ± 0.63	94.11 ± 0.30
EASY 3×ResNet12 (ours)	90.76 ± 0.19	93.90 ± 0.09

CIFAR-FS (transductive)

Methods	1-Shot 5-Way	5-Shot 5-Way
SSR [44]	76.80 ± 0.60	83.70 ± 0.40
iLPC [45]	77.14 ± 0.95	85.23 ± 0.55
DPGN [46]	77.90 ± 0.50	90.02 ± 0.40
ECKPN [47]	79.20 ± 0.40	91.00 ± 0.50
EASY 2×ResNet12 (1/sqrt(2)) (ours)	86.40 ± 0.23	89.75 ± 0.15
above <=12M	nb of parameters	below 36M
SSR [44]	81.60 ± 0.60	86.00 ± 0.40
fine-tuning (train+val) [49]	78.36 ± 0.70	87.54 ± 0.49
iLPC [45]	86.51 ± 0.75	90.60 ± 0.48
PT+MAP [51]	87.69 ± 0.23	90.68 ± 0.15
EASY 3×ResNet12 (ours)	86.96 ± 0.22	90.30 ± 0.15

FC-100 (transductive)

Methods	1-Shot 5-Way	5-Shot 5-Way
EASY 2×ResNet12( 1√2)(ours)	54.68 ± 0.25	66.19 ± 0.20
above <=12M	nb of parameters	below 36M
SIB+E3BM [50]	46.00	57.10
fine-tuning (train) [49]	43.16 ± 0.59	57.57 ± 0.55
ODC [43]	47.18 ± 0.30	59.21 ± 0.56
fine-tuning (train+val) [49]	50.44 ± 0.68	65.74 ± 0.60
EASY 3×ResNet12 (ours)	55.11 ± 0.25	67.09 ± 0.20

Tiered Imagenet (transducive)

Methods	1-Shot 5-Way	5-Shot 5-Way
PT+MAP [51]	85.67 ± 0.26	90.45 ± 0.14
TIM-GD [42]	79.90	88.50
ODC [43]	83.73 ± 0.36	90.46 ± 0.46
SSR [44]	81.20 ± 0.60	85.70 ± 0.40
Rot+KD+POODLE [48]	79.67	86.96
DPGN [46]	72.45 ± 0.51	87.24 ± 0.39
EPNet [31]	76.53 ± 0.87	87.32 ± 0.64
ECKPN [47]	73.59 ± 0.45	88.13 ± 0.28
iLPC [45]	83.49 ± 0.88	89.48 ± 0.47
ASY ResNet12 (ours)	82.66 ± 0.27	88.60 ± 0.14
above <=12M	nb of parameters	below 36M
SIB+E3BM [50]	75.60	84.30
SSR [44]	79.50 ± 0.60	84.80 ± 0.40
fine-tuning (train+val) [49]	72.87 ± 0.71	86.15 ± 0.50
TIM-GD [42]	82.10	89.80
LR+DC [17]	78.19 ± 0.25	89.90 ± 0.41
EPNet [31]	78.50 ± 0.91	88.36 ± 0.57
ODC [43]	85.22 ± 0.34	91.35 ± 0.42
iLPC [45]	88.50 ± 0.75	92.46 ± 0.42
PEMnE-BMS∗ [32]	86.07 ± 0.25	91.09 ± 0.14
EASY 3×ResNet12 (ours)	84.48 ± 0.27	89.71 ± 0.14

Comments

Argument setting for training other datasets

Sorry to bother again. I wonder if you can provide the argument setting for reproducing outcomes of the other datasets like CIFAR-FS or Tiered-ImageNet. Hope this won't bring you much trouble. Thx a lot.

opened by LIUZIJING-CHN 9
Questions about implementation.
Hello, thank you very much for your work, I have some questions.

I noticed that you added a argument "batch_fs_size" to args.py and its default value is set to 20, does this argument affect the test performance? From my understanding, it seems to affect only the GPU memory usage and speed during testing.

How to reproduce the results on CUB and tiered-ImageNet?

According to the command you gave for mini-ImageNet, the computation and back-propagation of a batch's loss is performed twice, where the first time is the classification loss with manifold mixup and the second time is the original classification loss and the rotation loss, i.e., a batch contains two gradient updates. Why does it need to be done in this way? Does this mean that the training of 500 epochs is approximately equivalent to the training of 1000 epochs for the methods with one-time gradient update for a batch? I personally tried to perform the manifold mixup classification and the roation loss together, (according to the code: if mixup and args.mm: ......) https://github.com/ybendou/easy/blob/85937e0d2d67a801dba7a96974a79c2d6cad86b7/main.py#L84-L96 but this resulted in a slight performance degradation, is this because there is a potential conflict between manifold mixup and rotation?

Have you studied the compatibility of the different augmentation methods with the self-supervised rotation loss? For example, does random horizontal flipping break the rotation loss? I'd like to hear your opinion.

Sorry for my poor English, I hope I expressed my question clearly. Best,
opened by whyygug 4
cannot reproduce the performance of 3xResNet12

Excuse me, I have followed the command of training a ResNet12 with the followings: python main.py --dataset-path ' ' --dataset miniimagenet --model resnet12 --epochs 0 --manifold-mixup 500 --rotations --cosine --gamma 0.9 --milestones 100 --skip-epochs 450 --batch-size 128 --preprocessing ME --save-model "result/official/mini1.pt" --n-shot [1,5] which is the same as the one you provided. But I can only reach the accuracy of 64 of each backbone, and after I ensembled the features the accuracy can only reach 65. The features I downloaded from OneDrive can have an accuracy of around 70 for each, which is far beyond mine. Hope you can solve my problem, thx!

opened by LIUZIJING-CHN 3
Training on tieredImageNet leads to CUDA out of memory.

Hello,

In this issue https://github.com/ybendou/easy/issues/11#issuecomment-1116237889 you mentioned changing the argument dataset to ‘tieredimagenet’ can reproduce the results on tieredImageNet.
My device is an RTX 3090 with 24GB memory. Even if I reduce the batch size from 128 to 64, there is still the problem of out of GPU memory during training. I noticed that the memory usage is only about 6GB during training, but once the training of a epoch is finished and the validation starts, the GPU memory is immediately exhausted. What should I do to complete the training on tieredImageNet?

Best,

opened by whyygug 3
How to test features on tieredimagenet

Hello,

May I know how to run test for tieredimagenet using ASY? I used the following command line, but it seems not working for multiple GPU. python main.py --dataset-path 'my data path' --dataset tieredimagenet --model resnet12 --test-features tieredfeaturesAS1.pt11 --preprocessing ME --n-shots 1 --device cuda:0123 If I just used --device cuda:0, I got out of memory error as the feature is too big for tieredimagenet. I am using 4 2080ti GPU with 12GB memory per device.

opened by Nan-S 3
Saving and deploying the Few-Shot model

Are we also saving the few-shot head here?

I am concerned with deploying the model in production. Looks like we can't just do a normal prediction on the loaded model but need to take care of the logic of few_shot_eval.eval_few_shot?

opened by rlleshi 2
Reproduce the outcome of 2x_ResNet12_√2

First of all, thx for your work! Since I want to reproduce the outcome of 2x_ResNet12_√2 which has an accuracy of 70 in 1-shot case on mini-ImageNet, I want to know what is the training setting of the main.py? I want to extract the features from my own-trained model, hope you can help me.

opened by LIUZIJING-CHN 2
about the feature

Hello, I have a question about the feature.

Why do the inductive and transductive use the same feature?

I thought they were using different ways to train cause the testing set had only been used in transductive learning, not inductive learning.

Thanks.

opened by 96091165 2
About GPU memory

Hi,

I try to test features on cifarfs by this command: """ $ python main.py --dataset-path "" --dataset cifarfs --model resnet12 --test-features "[/AS1.pt11, /AS2.pt11, /AS3.pt11]" --preprocessing ME --n-shots 1 """ and it output "RuntimeError: CUDA out of memory".

I am using 3080 GPU with 8GB memory. I also tried colab and got the same problem. May I know how to fix it?

Thanks

opened by jerryshih1106 2
Some questions about the code.
Hi, your work is wonderful! Thank you for the detailed and neat code! I have some questions about the code.

About the network:

I'm a bit confused about the network structure of ResNet12, it seems to have 3 convolutional layers in each BasicBlock? But the official ResNet18 provided by Pytorch has only two convolutional layers in each BasicBlock.

Why is the MaxPooling operation in ResNet12 performed outside the BasicBlock and not inside the BasicBlock as in ResNet18?

ResNet18 has a total of 17 convolutional layers and one FC layer, so it contains a total of 18 layers with trainable weights. But ResNet12 in this code has 12 convolutional layers and one FC layer, that is, it has 12 layers with trainable weights, should it be called ResNet13?

The number of parameters in ResNet12 (12.4M) is even more than that of ResNet18 (11.2M), which makes me confused. Why do we design a network that seems lighter from the name but is actually heavier? I replaced ResNet12 with ResNet18 for training and found that the performance dropped.

About training and testing:

When centering the feature vectors, each vector should minus an average feature vector. The paper says that “\overline{z} is the average feature vector of the base dataset if in inductive setting or of the few-shot considered problem if in transductive setting.” But the code seems to show that \overline{z} is still used as the average vector of the base dataset in transductive setting?

If I want to reproduce a Y ResNet12(√2) model, should the script be “python main.py --dataset-path "" --dataset miniimagenet --model resnet12 --epochs 0 --manifold-mixup 500 --rotations --cosine --gamma 0.9 --milestones 100 --batch-size 128 --preprocessing ME --feature_maps 45”

Why set forceCPU=True for val_loader and novel_loader?

What does the .pt55 suffix file represent in the pre-trained weights and features provided? I tried to test the features in minifeatures1.pt55 and found its performance inferior to minifeatures1.pt11. By the way, can Pytorch read and write files in .pt11 or .pt55 format? I know the common formats include .pt, .pkl, and .pth, but what format is .pt11?
opened by whyygug 2
Training with multiples datasets

Hi,

Great paper ! I have a question regarding the training procedure with multiples datasets. If I have lets say 6 datasets with 5 class each, with the old training procedure of MAML or ProtoNet, where the model is trained with episodes and tasks, this seems simple enough since each data set would be a task. With your training procedure (which seems to be used in all recent papers), how would you do it ? Concatenate all the classes to have 30 classes and train with batches made of samples of all datasets ?

Thanks a lot if you have the time to answer :)

opened by Gasp34 1

Owner

Yassir BENDOU

Ph.D student working on Few-shot learning problems. I enjoy maths and coding.

GitHub

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

Time series Timeseries Deep Learning Pytorch fastai - State-of-the-art Deep Learning with Time Series and Sequences in Pytorch / fastai

2.8k Jan 8, 2023

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

152 Jan 2, 2023

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

T-Few This repository contains the official code for the paper: "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learni

220 Dec 31, 2022

Quickly comparing your image classification models with the state-of-the-art models (such as DenseNet, ResNet, ...)

Image Classification Project Killer in PyTorch This repo is designed for those who want to start their experiments two days before the deadline and ki

349 Dec 8, 2022

The repo for the paper "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection".

33 Jan 5, 2023

[NeurIPS2021] Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks

Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks Code for NeurIPS 2021 Paper "Exploring Architectural Ingredients of A

26 Dec 1, 2022

Few-NERD: Not Only a Few-shot NER Dataset

Few-NERD: Not Only a Few-shot NER Dataset This is the source code of the ACL-IJCNLP 2021 paper: Few-NERD: A Few-shot Named Entity Recognition Dataset.

319 Dec 30, 2022

Library of various Few-Shot Learning frameworks for text classification

FewShotText This repository contains code for the paper A Neural Few-Shot Text Classification Reality Check Environment setup # Create environment pyt

47 Jan 3, 2023

Spatial Contrastive Learning for Few-Shot Classification (SCL)

This repo contains the official implementation of Spatial Contrastive Learning for Few-Shot Classification (SCL), which presents of a novel contrastive learning method applied to few-shot image classification in order to learn more general purpose embeddings, and facilitate the test-time adaptation to novel visual categories.

34 Dec 25, 2022

render sprites into your desktop environment as shaped windows using GTK

spritegtk render static or animated sprites into your desktop environment as dynamic shaped windows using GTK requires pycairo and PYGobject: pip inst

20 Oct 27, 2022

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped

CSWin-Transformer This repo is the official implementation of "CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows". Th

409 Jan 6, 2023

An official repository for Paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Uformer: A General U-Shaped Transformer for Image Restoration Zhendong Wang, Xiaodong Cun, Jianmin Bao and Jianzhuang Liu Paper: https://arxiv.org/abs

497 Dec 22, 2022

A unofficial pytorch implementation of PAN(PSENet2): Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network Requirements pytorch 1.1+ torchvision 0.3+ pyclipper opencv3 gcc

400 Dec 26, 2022

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

63 Oct 17, 2022

EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

Related tags

Overview

EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

Downloads

Testing scripts for EASY

Inductive setup using NCM

Transductive setup using Soft k-means

Training scripts for Y

Important Arguments

Few-shot classification Results

Comments

Owner

Yassir BENDOU

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Quickly comparing your image classification models with the state-of-the-art models (such as DenseNet, ResNet, ...)

The repo for the paper "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection".

[NeurIPS2021] Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks

Few-NERD: Not Only a Few-shot NER Dataset

Library of various Few-Shot Learning frameworks for text classification

Spatial Contrastive Learning for Few-Shot Classification (SCL)

render sprites into your desktop environment as shaped windows using GTK

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped

An official repository for Paper "Uformer: A General U-Shaped Transformer for Image Restoration".

A unofficial pytorch implementation of PAN(PSENet2): Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

Ready-to-use code and tutorial notebooks to boost your way into few-shot image classification.

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"

TransPrompt - Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification

vit for few-shot classification