Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Related tags

Overview

Deep Vision and Graphics

This repo supplements course "Deep Vision and Graphics" taught at YSDA @fall'21. The course is the successor of "Deep Learning" course taught at YSDA in 2015-2021. New course focuses more on applications of deep learning for computer vision.

Lecture and seminar materials for each week are in ./week* folders. Homeworks are in ./homework* folders.

General info

Telegram chat room (russian).
YSDA deadlines & admin stuff can be found at the YSDA LMS (ysda students only).
Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue

Syllabus

week01 Intro, recap of Neural network basics, optimization, backprop, biological networks
week02 Images, linear filtering, convolutional networks, batchnorms, augmentations
week03 ConvNet architectures and how to find them, sparse convolutions in 3D, ConvNets for videos, transfer learning
week04 Dense prediction: semantic segmentation, superresolution/image synthesis, perceptual losses
week05 Non-convolutional architectures: transformers (some recap of their use in NLP), mixers, FFT convolutions
week06 Visualizing and understanding deep architectures, adversarial examples
week07 Object detection, instance/panoptic segmentation, 2D/3D human pose estimation
week08 Representation learning: face recognition, verification tasks, self-supervised learning, image captioning
week09 Latent models (GLO, AEs, flow models, diffusion models, VQ-VAE, generative transformers, CLIP, DALL-E)
week10 Generative adversarial networks
week11 Shape and motion estimation: spatial transformers, optical flow, stereo, monodepth, point cloud generation, implicit and semi-implicit shape representations
week12 New view synthesis: multi-plane images, neural radiance fields, mesh-based and point-based representations for NVS, neural renderers

Contributors & course staff

Course materials and teaching performed by

Victor Lempitsky - all main track lectures
Victor Yurchenko - seminars, homeworks, admin stuff
Fedor Ratnikov - seminars, homeworks, admin staff
To be continued

You might also like...

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.

1k Jan 6, 2023

HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives

HashNeRF-pytorch Instant-NGP recently introduced a Multi-resolution Hash Encodin

616 Jan 6, 2023

NPBG++: Accelerating Neural Point-Based Graphics

[CVPR 2022] NPBG++: Accelerating Neural Point-Based Graphics Project Page | Paper This repository contains the official Python implementation of the p

57 Dec 3, 2022

Datasets, Transforms and Models specific to Computer Vision

torchvision The torchvision package consists of popular datasets, model architectures, and common image transformations for computer vision. Installat

13.1k Jan 2, 2023

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

107 Dec 2, 2022

This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian Sign Language.

LIBRAS-Image-Classifier This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian

26 Oct 14, 2022

[CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang

The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models Codes for this paper The Lottery Tickets Hypo

59 Dec 28, 2022

Scenic: A Jax Library for Computer Vision and Beyond

Scenic Scenic is a codebase with a focus on research around attention-based models for computer vision. Scenic has been successfully used to develop c

1.6k Dec 27, 2022

GluonMM is a library of transformer models for computer vision and multi-modality research

GluonMM is a library of transformer models for computer vision and multi-modality research. It contains reference implementations of widely adopted baseline models and also research work from Amazon Research.

42 Dec 2, 2022

Comments

fix dd_helper_modified

dd_helper_modified выполняла не то, ради чего была задумана. Если на вызвать ее с параметрами layer=0 iterations=0 lr=0 наш альбатрос все равно превратится в веб сайт. На вход функции подавался np.array со значениями в диапазоне 0 1 а на выходе уже нечто со значениями 0 255. Функция predict не устойчива к такому изменению параметров

opened by zhukovaes 1
Adversarial examples

predict(np.array(img_adv))

сетка тут ломается от того что входная картинка приведена к диапозону [0,255], при ожидаемом [0,1]. Если этот поправить, то сломать ее уже ни так просто, не сильно меняя картинку. im = Image.fromarray(np.uint8(input_im * 255))

opened by jenyav94 0

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Related tags

Overview

Deep Vision and Graphics

General info

Syllabus

Contributors & course staff

You might also like...

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)

HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives

NPBG++: Accelerating Neural Point-Based Graphics

Datasets, Transforms and Models specific to Computer Vision

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian Sign Language.

[CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang

Scenic: A Jax Library for Computer Vision and Beyond

GluonMM is a library of transformer models for computer vision and multi-modality research

Comments

fix dd_helper_modified

Adversarial examples

Owner

Yandex School of Data Analysis

Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

PyTorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision.

A PyTorch-Based Framework for Deep Learning in Computer Vision

TorchOk - The toolkit for fast Deep Learning experiments in Computer Vision

Deep Learning for Computer Vision final project

Computer vision - fun segmentation experience using classic and deep tools :)

QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

LeafSnap replicated using deep neural networks to test accuracy compared to traditional computer vision methods.

[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.

Repository for publicly available deep learning models developed in Rosetta community