Datakriya

Get More From Your Data

Introduction to Data Mining

Data mining is the process of turning raw data into useful information. Any numbers, text, facts, web pages or documents that can be processed by a computer are considered data and mining is the process of extracting something useful. Hence, as the name indicates, data mining is the process of extracting useful information from large volumes of data Read more

Introduction to Data Marts

A data mart is a repository of operational and aggregated data that supports the business enterprises in making critical decisions. The data mart consists of predefined subset of data that is organized for rapid analysis and reporting. The data mart focuses on meeting the different demands of a distinct group of users in terms of analysis and content Read more

Introduction to Data Warehousing

Data Warehouse is a storage place for data. It is used to store current and historical information.
According to Ralph Kimball, “Data warehouse is the conglomerate of all data marts within the enterprise. Information is always stored in the dimensional model” Read more

Introduction to Hive

Hive is SQL (Structured Query Language) type of programming language that runs on the platform of Hadoop. It was created to manage, pull, process large volume of data that Facebook produced.
We assume that you would already been familiar with the classical RDBMS (Relational Database Management System) and its underlying architecture along with the SQL structure and semantics. As Hive is part of Hadoop cluster so when a Hive query is submitted it gets converted into a sequence of Map Reduce jobs Read more

Introduction to PIG programming

Yahoo developed Pig and is one of the heaviest users of Hadoop, run 40 percent of all its Hadoop jobs with Pig.Twitter is also another well-known user of Pig.
Pig is a high-level Map Reduce based programming language built up over Hadoop platform. Read more

How Can We Help?

Our Focus

Data Management

  • Data Ingestion (Structured, Unstructured)
  • Data  Preprocess
  • Data Transformation
  • Data Cleanse
  • Data Profiling
  • Data Extraction in Desired Format

Data Governance

  • Data Policy
  • Data Standards
  • Data Ownership
  • Data Stewardship
  • Data Compliance
  • Data Security and Privacy

Data Analytics

  • Decide on the Objectives
  • Identify Business Metrics and Levers
  • Data Collection
  • Data Cleansing
  • Data Modeling
  • Optimize and Repeat

Our Expertise