Some of the skill sets sort after, when looking for a Data Scientist are ( Source : Careers @ EMC )
Additional Links :
- Strong statistical foundation, with broad knowledge of deterministic and probabilistic statistical methods.
- Proficiency in at least one of the following statistical toolkits: SAS, R, SPSS, Matlab, Mahout/MADLib.
- Programming strength in at least one of the following languages: SQL, C/C++, Java, Python, Perl.
- Optional programming strength in the following Hadoop tools: MapReduce, Pig, Hive, Hbase.
- Natural ability to communicate basic quantitative concepts clearly.
- Natural curiosity to research and identify possible quantitative solutions to common business problems.
- Technical knowledge of distributed computing platforms, and common data process flows from data instrumentation & generation, to ETL, to the data warehouse itself.
- Advanced degree (PhD or Masters) in an analytical or technical field (e.g. applied mathematics, statistics, physics, computer science, operations research
- A strong business-orientation, able to select the appropriate complex quantitative methodologies in response to specific business goals
Additional Links :
- Video : Data Science Team at Greenplum
- EMC Data Science Study 2011
- Data Scientist Certification with EMC Education Services
- Data Science Summit