Since this blog has been up I've fiddled with some text analysis stuff by analysing the text and making recommendations for similar blog entries. Did it all in PHP and MySQL just to understand how the algorithms work. Eventually it started to take about 5 hours to: tokenise and stemming the textcalculate...
Gaining insights with Natural Language Processing of Reddit Data to Evaluate Dermatology Patient Experiences and Therapeutics. "There is a lack of research studying patient-generated data on Reddit, one of the world’s most popular forums with active users interested in dermatology. Techniques within...
Source: sciencedirect.com
Understanding Latent Dirichlet Allocation with Gibbs Sampling by coding it from scratch. Latent Dirichlet Allocation (LDA) is a machine-learning technique that by the magic of many (many many) small calculations it can detect patterns in data and cluster documents, for example, into similar topics. ...
Source: github.io