Skip to content

Final Project of Indonesia AI NLP Bootcamp: Fine Tune LLM for Ciptaker Law (a.k.a. Omnibus Law) with additional RAG testing for Law Question Answering. Deployed with Web using Node.js

Notifications You must be signed in to change notification settings

Willy030125/LLM_Ciptaker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLM_Ciptaker (Final Project NLP Indonesia AI)

Beta Demo (Thanks to @shabir67)

ezgif-2-0c6c76e073

Data Source:

Train Fine-Tune LLM (with LoRA):

Retrieval Augmented Generation (RAG) testing:

  • RAG model: "llama3.1"
  • Embedding model: "nomic-embed-text"
  • RAG library: LangChain with Unstructured PDF Loader
  • Notebook: here

Frontend

  • Pre-requisites: Node version v20.17.0

Quick Run:

cd FE
npm install
npm start

Full Deployment:

Auto: Recreate the repo for FE files and host on Vercel.

Manual:

cd FE
npm install -g serve
serve -s build

Backend

To run the backend:

cd BE
python app.py

If hosted on a different PC, you may need a public IP or tunneling. Read more about tunneling here:

Notebook

Evaluation

RAG:

  • Cosine similarity: 0.40

Fine-Tuning LLM:

  • Perplexity: 1.0561115741729736
  • ROUGE: 0.7134693037488239
  • BLEU: 0.6164010763168335

About

Final Project of Indonesia AI NLP Bootcamp: Fine Tune LLM for Ciptaker Law (a.k.a. Omnibus Law) with additional RAG testing for Law Question Answering. Deployed with Web using Node.js

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages