Big Data Tools

Grid  List 

Set Descending Direction
per page

  1. Apache Spark

    Apache Spark

    Apache Spark – An Open Source Big Data Tool

    The Apache Spark is an open source system for fast and flexible large-scale data analysis. These include interactive exploration of very large datasets, near real-time stream processing, and ad-hoc SQL analytics. It is an extremely fast cluster computing system that can run data in memory. The main advantage of Apache Spark is that it runs 100 times faster than Hadoop Map reduce

    Download Apache Spark Learn More...
  2. Apache Drill

    Apache Drill

    Apache Drill – An Open Source Big Data Tool

    Apache Drill is an open source software framework that supports data-intensive distributed applications for interactive analysis of large-scale datasets. The main feature of Drill is it is able to scale to 10,000 servers or more and to be able to process petabytes of data and trillions of records in seconds.

    Download Apache Drill Learn More...
  3. D3.js

    D3.js

    D3.js – An Open Source Big Data Tool

    D3.js is an open source JavaScript library which allows you to manipulate documents that display Big Data. D3 stands for Data Driven Documents. D3 has been designed to be extremely fast, it supports Big Data datasets, and it has cross-hardware platform capability. D3.js is used to create dynamic graphics using Web standards like HTML5, SVG and CSS.

    Download D3.js Learn More...
  4. HCatalog

    HCatalog

    HCatalog- An Open Source Big Data Tool

    HCatalog is an open source metadata and table management framework that works with Hadoop HDFS data. HCatalog is used to liberate Big Data by allowing different tools to share, that means that Hadoop users making use of a tool like Pig or MapReduce or Hive have immediate access to data created with another tool, without any loading or transfer steps.

    Download HCatalog Learn More...
  5. Apache Storm

    Apache Storm

    Apache Storm- An Open Source Big Data Tool

    Apache Storm is an open source distributed real-time computation system. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing. Storm is simple, can be used with any programming language.

    Download Apache Storm Learn More...

Grid  List 

Set Descending Direction
per page



[profiler]
Memory usage: real: 22020096, emalloc: 21730496
Code ProfilerTimeCntEmallocRealMem