Tuesday, July 26, 2011



Getting Realtime quotes from yahoo using command line



Check out the tags that you need to send via command line and then parse the output.



Wednesday, July 6, 2011

Lua is widely used as a scripting language by game programmers and is also supported by Apple iOS

Howto Manage an EC2 Instance using PuttySSH

Launch Instance Wizard and pick the AMI for a Small Instance Type (32-bit). Enter in a name for the key you would like to create, and save it in a secure place on your local machine. We will need it for SSH login purposes later. Next setup basic firewall access for SSH (fix the web server and database firewall access later).Note the Public DNS that was assigned and save the generated key on our local machine. Amazon key ends with the extension .pem. Before we use PuTTY, we have to convert that key to PuTTY's format (.ppk) using PuTTYgen. Login with username= ec2-user, no password will be required.

Some Hadoop Topics

  • Top 10 Big Data Applications running on Hadoop Cloud Computing
  • Image processing with hadoop
  • Understanding the Shuffle Sort
  • Predictive Analytics
  • Map Reduce algorithms, the state of the art
  • Map Reduce vs Parallel Databases
  • Fully Utilizating your Hadoop Cluster
  • Mahout
  • HBase schema design and optimization
  • Big memory computing for data intensive scientific applications
  • Reasoning - When Hadoop Meets the Semantic Web
  • Hadoop 2.0 - impact of emerging new hadoop distros. Is Cloudera still relivent?
  • Using databases as input to big data processing jobs
  • Innovation needed in Hadoop to drive greater adoption
  • EMC's Big Data Stack
  • Which NoSql DB to choose?
  • Social Entrepreneurs and Impact investors: Triple Botton Line Assessments
  • Data Integration
  • Real Time Analytics using Hadoop 
  • Revolutionary Big Data Insight Engine
  • Hack proofing methods. Going beyond encryption. 
  • HBase schema design
  • Analysis of social activity using both network and content
  • Testing Big Data Technologies
  • Marrying Big Data with Advanced Analytics  (not to be given by me!  I want to learn about this)
  • Converging analytics and search using Big Data technologies.
  • hadoop pipes w/cloudera
  • Security issues with Big Data.
  • Data Analytics in Hadoop Ecosystem
  • Hive integration with HBase.
  • High Performance Virtual Database System using Hadoop/Map Reduce: Extending
  • MapReduce to RDBMS
  • Using MAHOUT and NOSQL DB over hadoop or Amazon EMR
  • How can we use hadoop with confidential/encrypted data?
  • Moving file(s) and file system legacy constructs to key/value stores to serialize unstructured pattern data and perform analytics.
  • Data Integration with HADOOP.
  • Virtual Business Ecysystem & Virtual Expo data integration
  • Data Gravity and it's effect on Public Cloud Providers
  • Analyzing customer behavior
  • Data collection with Flume
  • Use cases around Hadoop and EDW integration
  • Toughest part of building an reliable hadoop cluster.

Thursday, March 10, 2011

Youtube to MP3

Two easy steps
  1. wget "http://www.youtube.com/watch?v=cQRytgGffV4" -qO- awk '/fmt_url_map/{gsub(/[\\"]/,"\n");print}' sed -n "/^fmt_url_map/,/videoplayback/p" sed -e :a -e '$q;N;2,$D;ba' tr -d '\n' sed -e "s/\(.*\),\(.\)\{1,3\}/\1/;s/\\\//g" wget -i - -O surprise.flv
  2. ffmpeg -i suprise.flv /mnt/hgfs/Downloads/suprise.mp3