Skip to content

cppsea/InkWave

Repository files navigation

Abstract

InkWave is all about developing a machine learning model capable of converting handwritten notes into digitized ones. The digitized text would be used to be formatted into various files, including pdf and markdown. Ultimately we would be building a UI for the ML model so that we can see it in works and use it when required easily. We would be using Computer Vision, Natural Language Processing and Large Language Model in order to develop it.

InkWaveBasicWorkingPlan drawio

Dataset

We will be using these kaggle datasets to train our models: https://www.kaggle.com/datasets/landlord/handwriting-recognition/data (Learning CV Model Dataset) https://www.kaggle.com/datasets/vipin20/nlp-word-correction (Learning NLP/LLM Model Dataset) https://writeoff.cs.byu.edu/censustree/dataset.tar.gz (CV dataset) https://www.kaggle.com/datasets/jpmiller/layoutlm/data(Tentative NLP/LLM Model dataset)

Tech Doc Link

https://docs.google.com/document/d/1btfWL-6h_YLID7R_LKtT4L6QeBYJlxQ6F1TWeHM_u-8/edit?usp=sharing

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published