From Our Blog

mapreduce real world example

MapReduce Real World Example in Python : Learn Data Science

MapReduce is a great approach to problem solving. It is very popular too, but MapReduce examples other than word-count are scarce on the web. This article describes MapReduce problem solving that is beyond word-count. … Continue Reading >MapReduce Real World Example in Python : Learn Data Science

arduino serial communication diagram

Arduino Serial Communication Basics : Learn Arduino

Arduino uses asynchronous serial communication to send-receive data to and from other devices. Arduino Uno supports serial communication via on-board UART port and Tx/Rx pins. Generally this transmission happens at 9600 bits per second which is termed as baud rate. … Continue Reading >Arduino Serial Communication Basics : Learn Arduino

logarithms

Why Logarithms are Beautiful?

Binary logarithm or log2 n is the power to which the number 2 must be raised to obtain value n. Binary logarithm (and others) has numerous applications in computer science. Let’s take analysis of algorithms for example. All algorithms have a running time, also called time complexity of algorithms. … Continue Reading >Why Logarithms are Beautiful?

horton works

Python MapReduce with Hadoop Streaming in Hortonworks Sandbox

Hortonworks sandbox for Hadoop Data Platform (HDP) is a quick and easy personal desktop environment to get started on learning, developing, testing and trying out new features. It saves the user from installation and configuration of Hadoop and other tools. This article explains how to run Python MapReduce word count example using Hadoop Streaming. … Continue Reading >Python MapReduce with Hadoop Streaming in Hortonworks Sandbox

Extracting Text from PDF Using Apache Tika

Extracting Text from PDF Using Apache Tika – Learn NLP

Most NLP applications need to look beyond text and HTML documents as information is contained in PDF, ePub or other formats. Apache Tika is a toolkit that extracts meta data and text from documents. There is a REST based Python library for Tika. … Continue Reading >Extracting Text from PDF Using Apache Tika – Learn NLP

fastext

Tutorial: Text Classification With Python Using fastText

We start by training the classifier with training data. It contains questions from cooking.stackexchange.com and their associated tags on the site. Let’s build a classifier that automatically recognize a topic of the question and assign a label to it. … Continue Reading >Tutorial: Text Classification With Python Using fastText

Extracting Text from PDF Using Apache Tika

Getting Started with fastText : Learn NLP

fastText is a text representation and classification library from Facebook Research developed by FAIR lab. Classification of text documents is an important natural language processing (NLP) task. It is originally written in C++ but can be accessed using Python interface. It is massively fast. See references for two defining papers. … Continue Reading >Getting Started with fastText : Learn NLP

harry-potter-deathly-hallows

Programming Computers to Read Stories

Can a computer read the stories, the way humans do? Of course computers can read from files much faster and accurately but second part of the question is more important. When we read a story we understand it we read the feelings of the protagonist, challenge here is to make computers do the same. … Continue Reading >Programming Computers to Read Stories

Most Inspiring Satya Nadella Quotes

Nadella talks about culture, empathy, philosophy and trust apart from technology in his Hit Refresh: The Quest to Rediscover Microsoft’s Soul and Imagine a Better Future for Everyone. Six most powerful thoughts from the book that inspired me and will surely help you stay grounded. … Continue Reading >Most Inspiring Satya Nadella Quotes