A new Research and Markets report, "Global Hadoop Market 2012-2016," forecasts that the global Hadoop market might grow at a CAGR of 55.63 percent from 2012 to 2016 as the demand for big data analytics has increased.
However, the lack of trained professionals for the development of applications (apps) in Hadoop environment could pose a threat to the growth of this market.
Catering to the requirement of a simple development environment, Cloudera – a provider of open-source software related to big data – has announced the availability of industry’s first ever developer kit created for open-source Apache Hadoop distribution -- Cloudera's Distribution, including Apache Hadoop (CDH).
The new developer kit for CDH, Cloudera Developer Kit (CDK), incorporates a collection of application programming interfaces (APIs), tools, example code and documentation, which will help developers to build apps in Hadoop environments faster and easier than before.
Available as a free download on GitHub, the new CDK is open source and licensed under the same permissive Apache Software License. This feature will facilitate developers to use the code in any way they choose across existing commercial code bases or in any open source project.
In addition, CDK is modular in its approach. This feature will ultimately provide flexibility to developers, thereby enabling them to pick and choose the pieces they want to use, while freely substituting code of their own.
Elucidating about the new CDK, Eric Sammer, engineering manager, Cloudera, said in a statement, "At Cloudera we are not just Hadoop providers; we're also consumers who know first-hand the challenges developers can face when working with Hadoop."
Sammer added, "The new Cloudera Development Kit is one of the many ways we're sharing our deep expertise with the community. First-time Hadoop programmers will find that CDK walks them through each step of the process, enabling them to get up and running on the platform quickly, while more-experienced developers will appreciate the flexibility of CDK to swap out different components for a completely customized experience.”
One of the first modules to be included in the new CDK is the CDK Data module. This module is a set of APIs that simplifies working with datasets in Hadoop file systems, such as HDFS and the local file system.
In coming days, Cloudera will continue to add new modules to CDK, in order to extend its functionality and flexibility for developers.
Sammer further added, “By making Hadoop more accessible, we are excited to help an even broader range of organizations get more value out of their data."
Officials with Cloudera said that artifacts are also available from the Cloudera Maven Repository for Java developers using tools like Maven. This will facilitate for easy project integration.
In an effort to provide more information about the new CDK, Cloudera has added a new blog post and has also announced a new webinar titled, “Cloudera Development Kit (CDK): Hadoop Application Development Made Easier” to be hosted on May 21, 2013 by Sammer.
Edited by Rachel Ramsey