Skip to content

KevinFasusi/NaiveBayesDocumentClassifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Naive Bayes Document Classifier (WPF VB.net)

The hosted project is a naive bayes document classifier. Currently using a multinominal model, however the Bernoulli and Ensemble model are also planned. The project was initially written using VBA in excel (I know the horror!), to assist in my PhD research. Whilst it functioned and was okay, even if I do say so myself, the single threaded nature of Excel was a bottle neck (link to VBA project for anyone interested). Plus all the bells and whistles of an OOP langauge were sorely missed, So I rewrote it in VB.net. I can hear the C# boys and girls scoffing and spitting out their coffee. You're wasting good java there people. Why do we always focus on the things that divide us? We are all brothers and sisters in code 😎 .

Current Development

  • Indicating model accuracy
  • Saving project and dependencies, for easier quick tests.
  • Removing connectives
  • User Guide

Future Developments

  • Direct integeration with twitter api, for sentiment analysis of tweets
  • PDF to text conversion
  • Export model as JSON
  • Optimise Model

User Guide

  1. Application Overview
  2. Models
    1. Multinominal
    2. Bernoulli
    3. Ensemble
  3. Project Walk Through
    1. Project Setup
      1. Selecting drive for project folder structure
      2. Setting folder path and folder alias
      3. Adding preclassified documents (Training Set)
      4. Adding validation documents (Test Set)
    2. Train Model
  4. Model Accuracy
  5. Navigating UI and Settings
  6. Examples and Use Cases

Contact

My contact details are below, if you wish to discuss anything, reach me at:

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published