Main

February 26, 2016

The Era Of Open Source Platforms Is Here

Posted by Ganapathy Subramanian (View Profile | View All Posts) at 5:06 AM

The Era of Open Source Platforms Is Here

Today, the world of open source is witnessing major innovations, and at a considerably high pace. Intense focus from academia, Open Source communities (Hive, Spark, Apache, etc.) as well as industry leaders are making this possible, to a large extent. In addition, with service providers and other IT players adequately providing support on such open source components, enterprises are increasingly shifting their workloads to an open data architecture. Particularly, enterprises are making this move as data is becoming exponential - in terms of volume, variety and velocity - and license costs for proprietary data platforms are rising.

They are beginning to experience unprecedented cost-performance benefits with open-source technologies compared to proprietary systems. For instance, Spark, which uses in-memory capabilities of the underlying Hadoop with its own map reduce, is 100 times faster than a traditional Hadoop-based system.

Continue reading "The Era Of Open Source Platforms Is Here " »

November 18, 2015

Making Data Talk ─ On Courts, Fields, and Grounds

Posted by Ganapathy Subramanian (View Profile | View All Posts) at 4:35 AM

Infosys ATP Beyond The Numbers: Back From The Brink

The eminent historian Doris Kearns Goodwin says that long before she began writing Pulitzer Prize-winning novels, she got a knack for recording events in the form of a scorecard whenever her father took her to a baseball game. Baseball, like its older cousin cricket, can be a complicated sport filled with a dizzying array of statistics. There are reams of data on each player, his batting average, runs scored in each game, and the number of home runs in the regular season, just to name a few. Those stats are only the tip of the iceberg.

Indeed, keeping meticulous scorecards and calculating player data caches is one reason Goodwin became as famous a historian as she was a devoted sports fan. She loved using data to analyze events. If only she had the power of analytics that sports fans now enjoy. The truth is, there's always been data surrounding our favorite sports. Think match data from every ATP tennis game going back many decades, or scorecards from very old cricket games that can be found on the Wisden website (or in the annual Wisden Cricketers' Almanack, before the Internet). But until recently, it was not possible to make all this data talk meaningfully. That is because, the available computing power wasn't sufficient to keep up with it all.

Continue reading "Making Data Talk ─ On Courts, Fields, and Grounds" »