Skip to content

masum035/Bengali-Grapheme-Optical-Character-Recognition

Repository files navigation

GitHub GitHub Repo stars GitHub repo size GitHub forks GitHub last commit

Bengali Grapheme Optical Character Recognition Tool

Grapheme: the smallest units in a written language

Bengali is the 5th most spoken language in the world with hundreds of million of speakers. It’s the official language of Bangladesh and the second most spoken language in India. Considering its reach, there’s significant business and educational interest in developing AI that can optically recognize images of the language handwritten.

Dataset Link

Masum's Kaggle Dataset Contribution

Flow of Code

  1. data_Reading.ipynb
  2. checking_dataframe.ipynb
  3. datasetClass.py
  4. pickle_image_creation.ipynb
  5. dataset.ipynb
  6. models.py
  7. model_dispatcher.py
  8. just_for_test.ipynb
  9. train.py
  10. run.sh

Grapheme : The MultiClass Classifier

Grapheme Image

Acknowledgements

Feedback

If you have any feedback, please reach out to me at abdullahmasum6035@gmail.com