Hacking DailySocial's News

DailySocial.net is a tech blog founded by Rama Mamuaya. I enjoy visiting DailySocial and reading about the Indonesian tech scene. But yet I’ve grown weary of filtering news to read. So why not hack a news classifier I thought. Core Computing It took 10 minutes to hack something up in Python. Why so fast you ask? Because text processing is second nature in Python. NLTK is good but TextBlob is great. I used a Naive Bayes Classifier algorithm provided by TextBlob. All I coded was a feature extractor catered to my likings. ...

March 20, 2014 · Batista Harahap

WordGrapher - Build A Graph from Words and Documents

Just recently (last night), Steven Loria updated TextBlob to v0.5.0. The module enabled a relatively easy way to do Natural Language Processing in Python. NLTK is a dependency so it’s familiar turfs with an easier getting started part. Based on this, I did also did an easy way to parse a set of words and documents to measure important keywords based on TF-IDF algorithm. A few minutes ago I uploaded the module to PyPi and tagged it as v0.1.0. It’s still rough and what it does is just plain TF-IDF for now. The next version will incorporate the said graph building feature. ...

August 11, 2013 · Batista Harahap

Machine Favorited Tweets - Organically Improve Followers Count

Last night, I read about James Moriss’ blog post on how to gain more followers by favoriting other tweets relevant to your own tweets or some other keywords you are interested in. The downside was, you still have to input the keywords yourself. So I hacked up some codes let the codes figure out what keywords are proper. My first try into Python’s NLTK. WARNING: The codes below are not production ready codes. These are only proof of concepts and therefore should not be used in production environments without proper knowledge. ...

August 3, 2013 · Batista Harahap