Big Data is a big word. Lets break it up to understand it deeply.
So Big is huge, immense, enormous, vast, gigantic, giant, massive, prodigious and what not. But Big is also significant, important, major, vital, critical, crucial, grave and serious.
And Data, as we all know is a collection of facts and statistics, brought together for reference or analysis.
Now these facts and stats are not format-bound. Their collection is done in various forms. These forms may vary from simple texts to complex codes, images, video files, language driven stats, oral facets and a combination of all the above. With so much variegation, it is difficult to gather and process the information. So we have a bigger picture – Big Data.
Definition Big Data is a term that describes the large volume of data – both structured and unstructured – that inundates a business on a day-to-day basis. It is an evolving term that describes any voluminous amount of structured, semi-structured and unstructured data that has the potential to be minded for information. Although Big Data doesn’t refer to any specific quantity, the term is often used when speaking about petabytes and exabytes of data.
The IT industry uses following emerging technologies to handle Big Data in a cost-effective manner :
schema-less databases (NoSQL databases)
Indian startups are at par with rest of the world when it comes to technology. Following are some of the revered Indian startups working with Big Data.
Heckyl is a Mumbai based startup, using Big Data Technology. According to their LinkedIn profile, Heckyl provides real-time news analytics, live market analysis of India and US equity markets. It was founded by Som Sagar, Mukund Mudras, Jaison Mathews and Abhijit Vedak in December 2010.
With its hard work and dedication Heckyl aims to be a global leader in the space of information analytics for worldwide financial markets.
Mu sigma is a Bangalore based startup, founded in 2004 by Dhiraj C Rajaram. This firm mainly offers analytics services. The company is ISO 27001 certified. Mu Sigma solves high-impact business problems by accelerating the journey from Data Engineering to Data Sciences and Decision Sciences; thereby institutionalizing Decision Support. Mu Sigma uses a variety of commercial, open-source and proprietary tools and technologies to address business problems. The Big Data technologies used by this startup are – Hadoop, PIG, Hive, HBase and spark.
Spire Technologies is a Bangalore based startup, founded in 2008 by Saurabh Jain. According to their website, Spire is a context intelligence company. It means it generates contextual meaning out of ANY content whether it is text, audio or video. This startup has built world’s first context intelligence platform – PaaS. Spire context intelligence platform understands data making it far more meaningful and useful with far-reaching efficiency and cost savings for organizations. PaaS can be applied to finance, SCM, legal, education, defense, risks and many more domains. Spire Technologies evaluate your Big Data making decision-making more easy and meaningful to your business.
PromptCloud is a Bangalore based firm, founded by Prashant Kumar in 2009. This startup deals with Big Data solutions. Their main focus is on Web Crawling and data extraction. They have their hold in domains like travel, finance, health-care, marketing, analytics and more. PromptCloud operates on Data as a Service (DaaS) model. The DaaS platform has been designed taking into account various use cases and the non-uniform formats on the web. It is built on open-source technologies – Hive, Nutch, Lucene, Cassandra, Chef, talend and Hadoop.
Metaome is again a Bangalore based startup, co-founded by Kalpana Krishnaswami and Ramkumar Nandakumar in 2013. It is a Big Data company that delivers knowledge-mining solutions. Their main focus is on health care services and life sciences industries. DistilBio is the product offered by this startup for enterprise level data integration, search and discovery. DistilBio Enterprise enables the user to seamlessly search across the data and identify hidden relationships within the data. DistilBio brings all your data together and acts as a knowledge base complimentary to your data sources. Your data can be in spread sheets, plain text or in databases. DistilBio brings them together seamlessly.