Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Overview

Deep Vision and Graphics

This repo supplements course "Deep Vision and Graphics" taught at YSDA @fall'21. The course is the successor of "Deep Learning" course taught at YSDA in 2015-2021. New course focuses more on applications of deep learning for computer vision.

Lecture and seminar materials for each week are in ./week* folders. Homeworks are in ./homework* folders.

General info

  • Telegram chat room (russian).
  • YSDA deadlines & admin stuff can be found at the YSDA LMS (ysda students only).
  • Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue

Syllabus

  • week01 Intro, recap of Neural network basics, optimization, backprop, biological networks
  • week02 Images, linear filtering, convolutional networks, batchnorms, augmentations
  • week03 ConvNet architectures and how to find them, sparse convolutions in 3D, ConvNets for videos, transfer learning
  • week04 Dense prediction: semantic segmentation, superresolution/image synthesis, perceptual losses
  • week05 Non-convolutional architectures: transformers (some recap of their use in NLP), mixers, FFT convolutions
  • week06 Visualizing and understanding deep architectures, adversarial examples
  • week07 Object detection, instance/panoptic segmentation, 2D/3D human pose estimation
  • week08 Representation learning: face recognition, verification tasks, self-supervised learning, image captioning
  • week09 Latent models (GLO, AEs, flow models, diffusion models, VQ-VAE, generative transformers, CLIP, DALL-E)
  • week10 Generative adversarial networks
  • week11 Shape and motion estimation: spatial transformers, optical flow, stereo, monodepth, point cloud generation, implicit and semi-implicit shape representations
  • week12 New view synthesis: multi-plane images, neural radiance fields, mesh-based and point-based representations for NVS, neural renderers

Contributors & course staff

Course materials and teaching performed by

You might also like...
General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)
General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.

HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives
HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives

HashNeRF-pytorch Instant-NGP recently introduced a Multi-resolution Hash Encodin

NPBG++: Accelerating Neural Point-Based Graphics

[CVPR 2022] NPBG++: Accelerating Neural Point-Based Graphics Project Page | Paper This repository contains the official Python implementation of the p

Datasets, Transforms and Models specific to Computer Vision

torchvision The torchvision package consists of popular datasets, model architectures, and common image transformations for computer vision. Installat

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian Sign Language.
This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian Sign Language.

LIBRAS-Image-Classifier This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian

[CVPR 2021]
[CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang

The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models Codes for this paper The Lottery Tickets Hypo

Scenic: A Jax Library for Computer Vision and Beyond

Scenic Scenic is a codebase with a focus on research around attention-based models for computer vision. Scenic has been successfully used to develop c

GluonMM is a library of transformer models for computer vision and multi-modality research

GluonMM is a library of transformer models for computer vision and multi-modality research. It contains reference implementations of widely adopted baseline models and also research work from Amazon Research.

Comments
  • fix dd_helper_modified

    fix dd_helper_modified

    dd_helper_modified выполняла не то, ради чего была задумана. Если на вызвать ее с параметрами layer=0 iterations=0 lr=0 наш альбатрос все равно превратится в веб сайт. На вход функции подавался np.array со значениями в диапазоне 0 1 а на выходе уже нечто со значениями 0 255. Функция predict не устойчива к такому изменению параметров

    opened by zhukovaes 1
  • Adversarial examples

    Adversarial examples

    predict(np.array(img_adv))

    сетка тут ломается от того что входная картинка приведена к диапозону [0,255], при ожидаемом [0,1]. Если этот поправить, то сломать ее уже ни так просто, не сильно меняя картинку. im = Image.fromarray(np.uint8(input_im * 255))

    opened by jenyav94 0
Owner
Yandex School of Data Analysis
Yandex School of Data Analysis
Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

Monk - A computer vision toolkit for everyone Why use Monk Issue: Want to begin learning computer vision Solution: Start with Monk's hands-on study ro

Tessellate Imaging 507 Dec 4, 2022
PyTorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision.

PyTorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision @misc{CV2018, author = {Donny You ([email protected])}, howpubl

Donny You 40 Sep 14, 2022
A PyTorch-Based Framework for Deep Learning in Computer Vision

TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision @misc{you2019torchcv, author = {Ansheng You and Xiangtai Li and Zhen Zhu a

Donny You 2.2k Jan 9, 2023
TorchOk - The toolkit for fast Deep Learning experiments in Computer Vision

TorchOk - The toolkit for fast Deep Learning experiments in Computer Vision

null 52 Dec 23, 2022
Deep Learning for Computer Vision final project

Deep Learning for Computer Vision final project

grassking100 1 Nov 30, 2021
Computer vision - fun segmentation experience using classic and deep tools :)

Computer_Vision_Segmentation_Fun Segmentation of Images and Video. Tools: pytorch Models: Classic model - GrabCut Deep model - Deeplabv3_resnet101 Flo

Mor Ventura 1 Dec 18, 2021
QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

This project provides abundant choices of quantization strategies (such as the quantization algorithms, training schedules and empirical tricks) for quantizing the deep neural networks into low-bit counterparts.

Monash Green AI Lab 51 Dec 10, 2022
LeafSnap replicated using deep neural networks to test accuracy compared to traditional computer vision methods.

Deep-Leafsnap Convolutional Neural Networks have become largely popular in image tasks such as image classification recently largely due to to Krizhev

Sujith Vishwajith 48 Nov 27, 2022
[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.

Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, CVPR 2021. Ayan Kumar Bhunia, Pinaki nath Chowdhury, Yongxin Yan

Ayan Kumar Bhunia 44 Dec 12, 2022
Repository for publicly available deep learning models developed in Rosetta community

trRosetta2 This package contains deep learning models and related scripts used by Baker group in CASP14. Installation Linux/Mac clone the package git

null 81 Dec 29, 2022