Wednesday, July 6, 2011

Some Hadoop Topics

  • Top 10 Big Data Applications running on Hadoop Cloud Computing
  • Image processing with hadoop
  • Understanding the Shuffle Sort
  • Predictive Analytics
  • Map Reduce algorithms, the state of the art
  • Map Reduce vs Parallel Databases
  • Fully Utilizating your Hadoop Cluster
  • Mahout
  • HBase schema design and optimization
  • Big memory computing for data intensive scientific applications
  • Reasoning - When Hadoop Meets the Semantic Web
  • Hadoop 2.0 - impact of emerging new hadoop distros. Is Cloudera still relivent?
  • Using databases as input to big data processing jobs
  • Innovation needed in Hadoop to drive greater adoption
  • EMC's Big Data Stack
  • Which NoSql DB to choose?
  • Social Entrepreneurs and Impact investors: Triple Botton Line Assessments
  • Data Integration
  • Real Time Analytics using Hadoop 
  • Revolutionary Big Data Insight Engine
  • Hack proofing methods. Going beyond encryption. 
  • HBase schema design
  • Analysis of social activity using both network and content
  • Testing Big Data Technologies
  • Marrying Big Data with Advanced Analytics  (not to be given by me!  I want to learn about this)
  • Converging analytics and search using Big Data technologies.
  • hadoop pipes w/cloudera
  • Security issues with Big Data.
  • Data Analytics in Hadoop Ecosystem
  • Hive integration with HBase.
  • High Performance Virtual Database System using Hadoop/Map Reduce: Extending
  • MapReduce to RDBMS
  • Using MAHOUT and NOSQL DB over hadoop or Amazon EMR
  • How can we use hadoop with confidential/encrypted data?
  • Moving file(s) and file system legacy constructs to key/value stores to serialize unstructured pattern data and perform analytics.
  • Data Integration with HADOOP.
  • Virtual Business Ecysystem & Virtual Expo data integration
  • Data Gravity and it's effect on Public Cloud Providers
  • Analyzing customer behavior
  • Data collection with Flume
  • Use cases around Hadoop and EDW integration
  • Toughest part of building an reliable hadoop cluster.

No comments:

Post a Comment