Skip to content
@SCUT-DLVCLab

SCUT-DLVCLab

华南理工大学深度学习与视觉计算实验室

About Us 🚀

The Deep Learning and Vision Computing Lab is dedicated to advanced theoretical research and innovative applications in the fields of artificial intelligence, computer vision, machine learning, and pattern recognition. Our current research focuses on deep learning, text detection and recognition, document analysis and understanding, and artificial intelligence. In recent years, our team has led more than 30 national and provincial research projects, making significant achievements in optical character recognition (OCR), handwriting recognition, gesture recognition and interaction technology, and innovative applications of deep learning. We have published over 300 SCI/EI papers, obtained more than 50 authorized invention patents, won 5 provincial and ministerial science and technology awards, and achieved first place in international academic competitions 4 times.

Pinned

  1. GPT-4V_OCR GPT-4V_OCR Public

    Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

    Python 106 3

  2. Document-AI-Recommendations Document-AI-Recommendations Public

    Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

    127 1

  3. SCUT-EnsExam SCUT-EnsExam Public

    SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper images. The dataset is randomly divided into training set and …

    7

  4. RFUND RFUND Public

    Official release of RFUND introduced in the paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction" (arXiv:2401.03472).

    13

Repositories

Showing 9 of 9 repositories
  • WenMind Public

    WenMind benchmark.

    0 0 0 0 Updated Jun 6, 2024
  • MegaHan97K Public

    MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories

    0 0 0 0 Updated Jun 4, 2024
  • .github Public
    0 0 0 0 Updated Jun 4, 2024
  • C3bench Public

    C3 benchmark

    0 0 0 0 Updated May 27, 2024
  • Document-AI-Recommendations Public

    Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

    127 1 0 0 Updated May 13, 2024
  • RFUND Public

    Official release of RFUND introduced in the paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction" (arXiv:2401.03472).

    13 0 0 0 Updated Mar 22, 2024
  • SCUT-EnsExam Public

    SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper images. The dataset is randomly divided into training set and test set of 430 and 115 images, respectively.

    7 0 0 0 Updated Dec 5, 2023
  • GPT-4V_OCR Public

    Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

    Python 106 3 2 0 Updated Nov 13, 2023
  • Mnist-99.7-Accuracy-with-Pytorch Public

    A CNN model builds with Pytorch and reaches 99.7% accuracy

    Python 4 1 0 0 Updated May 1, 2021

Top languages

Python

Most used topics

Loading…