Skip to content
#

vision-transformer

Here are 757 public repositories matching this topic...

computer-vision-challenge

Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.

  • Updated Jun 6, 2024
  • C#

A comprehensive list [SAMRS@NeurIPS'23, RVSA@TGRS'22, RSP@TGRS'22] of our research works related to remote sensing, including papers, codes, and citations. Note: The repo for [TGRS'22] "An Empirical Study of Remote Sensing Pretraining" has been moved to: https://github.com/ViTAE-Transformer/RSP

  • Updated Jun 6, 2024
  • TeX

Code for Video Deepfake Detector from "MINTIME: Multi-Identity Size-Invariant Video Deepfake Detection", paper available on IEEE Transactions on Information Forensics and Security.

  • Updated Jun 5, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the vision-transformer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-transformer topic, visit your repo's landing page and select "manage topics."

Learn more