Skip to content

The "LLM Projects Archive" is a centralized GitHub repository, offering a diverse collection of Language Model Models projects. A valuable resource for researchers, developers, and enthusiasts, it showcases the latest advancements and applications in the realm of LLMs. Explore and contribute to the dynamic landscape of language model projects.

License

Notifications You must be signed in to change notification settings

Praveen76/LLM-Projects-Archive

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLM Projects Archive

LLM NLP OpenAI's GPT Models Open Source LLMs Hugging Face PyTorch

Welcome to the LLM Projects Archive! This repository serves as a curated collection of projects related to Large Language Models (LLMs). If you are interested in exploring and contributing to projects that leverage LLMs for various applications, you're in the right place.

Table of Contents


Introduction

Large Language Models (LLMs), such as GPT-3, BERT, and others, have revolutionized natural language processing and understanding. This repository aims to bring together a diverse set of projects that utilize LLMs for different purposes. Whether it's text generation, sentiment analysis, summarization, or any other application, this archive aims to showcase the versatility and creativity of LLM-based projects.


LLM Projects

  1. Introduction to Huggingface, Spacy, and PyTorch:

    • Description: This repository provides an introduction to HuggingFace, PyTorch, and Spacy, focusing on building NLP projects. It includes links to several other repositories, each containing notebooks to familiarize oneself with these libraries.
    • Repository: Introduction to Huggingface, Spacy, and PyTorch
  2. Introduction to LLMs:

    • Description: A comprehensive GitHub project offering a beginner-friendly guide and hands-on examples to understand and work with Large Language Models (LLMs) in the field of NLP( Natural Language Processing).
    • Repository: Introduction to LLMs
  3. Customize Word Embeddings for LLMs:

    • Description: It's a GitHub project focused on tailoring word embeddings specifically for Large Language Models, enabling fine-tuned linguistic representations to enhance performance in domain-specific applications
    • Repository: Customize Word Embeddings for LLMs
  4. Create Word Embeddings using Word2Vec and GloVe:

  5. Finetune Open-source LLMs on Custom Data:

    • Description: It's a GitHub project focused on fine-tuning open-source LLMs on custom datasets, showcasing tailored solutions for enhanced language models.
    • Repository: Finetune Open-source LLMs on custom data
  6. Build & evaluate Retrieval-Augmented-Generation pipelines:

  7. Finetune Llama using QLoRA Method:

    • Description: "GitHub project for fine-tuning the Llama language model using the QLoRA (Quantized Low Rank Adaption) method for enhanced natural language understanding."
    • Repository: Finetune Llama using QLoRA Method
  8. Conversationl AI system using LLMs on E-commerce Data:

  9. Text Classification using Transformer Encoder Model:

  10. Language Translation using Transformer Decoder Model:

    • Description: This repository contains code for language translation using the Transformer Decoder Model. You'll learn about the Transformer architecture and apply it to a machine translation problem.
    • Repository: Language Translation using Transformer Model
  11. Text Classification using BERT Model:

    • Description: This repository contains code for building a text classification model using the BERT (Bidirectional Encoder Representations from Transformers) Model. The IMDB dataset has been utilized for this experiment.
    • Repository: Text Classification using BERT Model
  12. Build RAG-pipelines using Llama-Index:

  13. Finetune GPT2 Model on downstream tasks:

  14. PEFT for Text summarization:

    • Description: This repository demonstrates how to PEFT( Parameter-efficient-finetuning) for text summarization task.
    • Repository: PEFT for Text summarization
  15. Finetune T5-Model_for Text Summary:

  16. Finetune Llama2 and Mistral7B using Langchain:


Pre-LLM era Projects:

  1. Text Classification using Naive Bayes Classifier:

  2. Build a Custom NER Model using Spacy:

    • Description: This repository contains code for building a custom Named Entity Recognition (NER) model using the spaCy library. The Medical NER dataset has been utilized for this experiment.
    • Repository: Build a Custom NER Model using Spacy
  3. Sentiment Analysis using LSTM Model:

    • Description: This repository contains a Python notebook for Sentiment Analysis using the LSTM (Long Short-Term Memory) model. The IMDB dataset has been utilized for this experiment.
    • Repository: Sentiment Analysis using LSTM Model
  4. Build a Forecasting Model using RNN:

    • Description: This repository contains code for building a forecasting model using Recurrent Neural Networks (RNNs). A climate-related dataset has been utilized for this experiment.
    • Repository: Build a Forecasting Model using RNN

How to Contribute

We encourage you to contribute to this archive by adding your own LLM-related projects or discovering new ones. Follow these steps to contribute:

  1. Fork the repository.
  2. Add your project information to the projects.md file. Include the project name, a brief description, and the GitHub repository link.
  3. If your project falls into a specific category (e.g., sentiment analysis, chatbots, translation), please categorize it accordingly.
  4. Submit a pull request.

Please adhere to the contributing guidelines for a smooth collaboration.

License

This repository and its contents are open-sourced under the MIT License. Feel free to use, modify, and distribute these projects in accordance with the terms specified in the license.

Issues:

If you encounter any issues or have suggestions for improvement, please open an issue in the Issues section of this repository.

Contact:

The code has been tested on Windows system. It should work well on other distributions but has not yet been tested. In case of any issue with installation or otherwise, please contact me on Linkedin

Happy coding!!

About Me:

I’m a seasoned Data Scientist and founder of TowardsMachineLearning.Org. I've worked on various Machine Learning, NLP, and cutting-edge deep learning frameworks to solve numerous business problems.

About

The "LLM Projects Archive" is a centralized GitHub repository, offering a diverse collection of Language Model Models projects. A valuable resource for researchers, developers, and enthusiasts, it showcases the latest advancements and applications in the realm of LLMs. Explore and contribute to the dynamic landscape of language model projects.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages