indian-nlp / nanoGPT-assamese

assamese poem generator using nanoGPT. trained on arambhani.txt from kaggles.

Home Page:http://aakhor.feynmanpi.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Contributors Forks Stargazers Issues MIT License LinkedIn


Logo

NanoGPT Assamese

NanoGPT trained on Assamese dataset
Releasing Soon to Public
Explore the docs »

Report Bug · Request Feature

Screenshots

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Contributing
  5. License
  6. Contact
  7. Acknowledgments

About The Project

(#about-the-project)

Assamese or Asamiya is an Indo-Aryan language spoken mainly in the north-eastern Indian state of Assam, where it is an official language. It serves as a lingua franca of the wider region and has over 15 million native speakers according to Ethnologue.

This project aims to train the nanoGPT using an Assamese language dataset.

Contributions are welcome! Please see the Contributing section to get started.

(back to top)

Built With

  • nanoGPT
  • dataset: arambhani.txt

(back to top)

Getting Started

To run this code locally, follow these steps.

Prerequisites

  • Web Browser

Installation

  1. Clone the repo
    git clone https://github.com/indian-nlp/nanoGPT-assamese.git
    
  2. Go to the project repo folder
    cd path\to\directory
    
  3. Create Virtual Env (Recommended)
    virtualenv venv_name
    
  4. Install the required Python packages using pip
    pip install -r 'requirements.txt'
    
  5. Run the sample.py file to get the sample of the model
    python sample-old.py --out_dir=out-assamese --device=cpu
    
    or, provide a starting text to generate from:
    python sample-old.py --out_dir=out-assamese --device=cpu --start=কৃতাঞ্জলি
    

(back to top)

See the open issues for a list of proposed features and known issues.

(back to top)

Contributing

Contributions are welcome! Any contributions you make are greatly appreciated.

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

Don't forget to give the project a star! Thank you!

(back to top)

License

MIT License.

(back to top)

Contact

Sagar Tamang - LinkedIn - cs22bcagn033@kazirangauniversity.in

Official Website: https://sagartamang.com

Project Link: https://github.com/indian-nlp/nanoGPT-assamese

(back to top)

Acknowledgments

  • I would like to thank ChatGPT for helping me debug all the errors.

(back to top)

About

assamese poem generator using nanoGPT. trained on arambhani.txt from kaggles.

http://aakhor.feynmanpi.com/

License:MIT License


Languages

Language:Python 100.0%