TMCnet News
Anaconda Reveals Results of First 'State of Data Science' SurveyAnaconda, Inc., the most popular Python data science platform provider with 2.5 million downloads per month, today announced the results of its State of Data Science survey, revealing key trends in data science and machine learning within the Anaconda community. The survey, which ran from March 22 to April 30, 2018, resulted in 4,218 responses with a 100% survey completion rate. The majority of respondents were students (26%), followed by data scientists (16%), academics (15%) and software developers (15%). "The shift from managing big data to making data actionable is more important than ever in the enterprise," said Krishnan Subramanian, Chief Research Analyst, Rishidot Research. "Anaconda is easy to use and its users are experiencing clear value in their machine learning platform for cloud native especially as they transition to new technologies like containers." The State of Data Science The Anaconda State of Data Science is strong. With 2 to 2.5 million downloads per month during January to March 2018, Anaconda is easily the most popular Python distribution, with a growing R following. Key findings of the survey include:
Data Scientists Dropping Big Data and Looking at Containers and Cloud Traditional Hadoop-style "big data" performed relatively weakly versus the other options given this is a data-centric audience, and that Hadoop has dominated on-premises (non-cloud) data infrastructure for the past 10 years and spawned two tech IPOs (Hortonworks and Cloudera). From this, one could conclude that what was "big data" in 2005 when Hadoop began now easily fits into a single server's memory and there is a plethora of alternatives to building a Hadoop data lake. Additionally, containers are growing in production. Docker makes a strong showing at 19%, beating out Hadoop/Spark with 15%, followed by Kubernetes at 5.8%. These results suggest that modern cloud-native style architectures like Docker and Kubernetes are rising, again at the expense of traditional Hadoop "big data" and Apache Mesos (0.85%). Additional findings of interest include:
Supporting Resources About Anaconda, Inc. With over six million users, Anaconda is the world's most popular Python data science platform. Anaconda, Inc. continues to lead open source projects like Anaconda, NumPy and SciPy that form the foundation of modern data science. Anaconda's flagship product, Anaconda Enterprise, allows organizations to secure, govern, scale and extend Anaconda to deliver actionable insights that drive businesses and industries forward.
View source version on businesswire.com: https://www.businesswire.com/news/home/20180613005279/en/ |