Posts

Showing posts with the label Data Science

Top reasons to join EMC Academic Alliance

Image
Did you know more than 200 colleges in India are part of EMC Academic Alliance   Here are my top reasons to join :  Prepares students on Storage, Cloud Infra, Data Science, Backup and Cloud Security. Offered at Zero Cost Opportunity for students to get certified on these technologies for less than $25 Only technology program from an industry leader, as we don't believe in vendor lock-ins No difference in offerings for academia and  corporate Most social educational program with more than 750K on facebook alone Developer opportunities for students  Fantastic support from Local Center of Excellence for Faculty and student events including visits  Help certified candidates find the right career ( coming soon ) Refer your College or University to EMC Academic Alliance via   http://www.surveymonkey.com/s/BPWQQTF

Latest SBIC Report Reveals Accelerated Enterprise Adoption of Big Data, Mobile, Social Media and Cloud Computing Introduce Significant Gaps in Information Security Programs

RSA Security, released a special report from the Security for Business Innovation Council (SBIC) that assesses how disruptive innovations such as Big Data analytics , cloud computing, enterprise mobility and social media will transform enterprise IT and hammer away at the foundations of information security strategies in 2013.  The Council’s latest report details four strategies to help enterprises adapt information security programs to help enable business innovation over the next 12 months. These strategies include how to boost risk and business skills, court middle management, tackle IT supply chain issues and build tech-savvy action plans. The Council’s guidance will help enterprises face the impact of the technology adoption of cloud computing , social media , mobile and Big Data . The Council also outlines the major impacts of these trends for security teams and how to address them.  Cloud Computing – The accelerated adoption of cloud will push security conc...

Data Science Series Booklets

Image
EMC  Greenplum sponsored a series of  thought-provoking  booklets, edited by  innovation technology guru Peter Hinssen  and published by Across Technology.  To keep you  informed on a constant basis, they have created  the Data Science Series website ( www.datascienceseries.com ), offering you case stories  from your peers, valuable insight into market research and an overview of the Catalyst partners  that help EMC Greenplum bring the right building blocks to the market. Allowing you to build  the right ‘refinery’ for all the information that is coming your way. Make sure you don’t miss the installments of the series. Click below to read them one by one -  Information is the new oil -  Markets are fast disappearing, and being replaced by networks. Networks of intelligence. Consumers can’t be controlled anymore, and become active, and are turning markets into dynamic systems. The age-old subject of mar...

EMC Greenplum Chorus is open source - OpenChorus Project

Image
In the legacy analytics process, data scientists face challenges in accessing and sharing the right data. GreenplumChorus helps foster a complete data science ecosystem with best-of-breed analytics applications. As a social platform for collaborative data science, Greenplum Chorus users can increase productivity, decrease administrative burdens on IT infrastructures, and get better visibility and faster access to data through a single tool. EMC recently released the Greenplum Chorus source code under an Apache open source license through the  OpenChorus  Project. The OpenChorus Project will speed innovation and adoption of collaborative data science practices, helping organizations to drive greater business insight and economic value from Big Data. Making OpenChorus accessible to Data Scientists Greenplum and Kaggle joined forces to tackle the short supply and heavy demand for data scientists with an integration between the Kaggle data science c...

Data Scientist as a career option #datasci

Image

Visualizations from The Human Face of Big Data

Image
Data visualization  is the study of the visual representation of data, meaning "information that has been abstracted in some schematic form, including attributes or variables for the units of information". The Human Face of Big Data   is a global, crowdsourced media project focusing on humanity's new ability to collect, analyze and visualize vast amounts of data in real time.   Data visualizations help paint a picture of how Big Data affects and measures our lives. These revealing new interactive data visualizations were created for The Human Face of Big Data project Mission Control. A team of designers and data scientists from EMC Cloud Services, EMC Greenplum, and Tableau Software analyzed one billion unique global tweets from Twitter, along with other data, to amplify themes and stories from the project. A data set of approximately 170 billion unique data elements was drawn from the billion tweets and loaded into an EMC Greenplum Data Computing A...

EMC at Oracle OpenWorld 2012 - Cloud, Big Data and Trust

Image
Sharing some screen shots from the Keynote session ( by Joe Tucci, EMC Chairman, President and CEO, and Jeremy Burton, EVP, Product Operations and Marketing ) available on   http://bit.ly/PyFL4A  and some key take aways. Just amazed to see the way EMC has transformed and enabling others in this transformation Digital Universe is Growing Faster than Expected  Cloud Computing Transforms IT It's all FLASH  Storage - Network and Sever - EMC Working on building products to address data at all levels ( Project X, Project Thunder and VFCache ) Data Scientist and Data Science are HOT Software Defined Data Centers are coming Putting a Face of Big Data - #HFOBD - The Biggest Crowd-sourced Project   Also, some good works going on Big Data Analytics with Greenplum Chorus and UAP. Not to forget the Massive Parallel Processing capabilities and Scale-out Stor...

What is Big Data ?

Image
Big data  (also spelled Big Data) is a general term used to describe the voluminous amount of unstructured and semi-structured data a company creates -- data that would take too much time and cost too much money to load into a relational database for analysis. Although Big data doesn't refer to any specific quantity, the term is often used when speaking about petabytes and exabytes of data. To clarify matters, the three Vs of volume, velocity and variety are commonly used to characterize different aspects of big data. Big data is being enabled by inexpensive storage, a proliferation of sensor and data capture technology, increasing connections to information via the cloud and virtualized storage infrastructures, and innovative software and analysis tools. Big data is not a "thing" but instead a dynamic/activity that crosses many IT borders. IDC defines it this way:  Big data technologies describe a new generation of...

What is "The Human Face of Big Data" Project

Image
“ The Human Face of Big Data ,” the latest groundbreaking, globally crowdsourced initiative from Rick Smolan, the creator of the “Day in the Life” series. The project, made possible through primary sponsorship from EMC, is based on the premise that the real-time visualization of data collected by satellites, and by billions of sensors, RFID tags, and GPS-enabled cameras and smartphones around the world, is enabling humanity to sense, measure, understand and affect aspects of our existence in ways our ancestors could never have imagined in their wildest dreams.  The multifaceted project kicks off on September 25 with an eight-day “Measure Our World” event inviting people around the world to share and compare their lives in real time through an innovative smartphone app. The project also includes “Mission Control” events in New York, Singapore and London; “Data Detectives,” a global student initiative being conducted in conjunction with the TED organization; a stunning large-...

What makes a Data Scientist ?

Image
Some of the skill sets sort after, when looking for a Data Scientist are ( Source : Careers @ EMC )   Strong statistical foundation, with broad knowledge of deterministic and probabilistic statistical methods. Proficiency in at least one of the following statistical toolkits: SAS, R, SPSS, Matlab, Mahout/MADLib. Programming strength in at least one of the following languages: SQL, C/C++, Java, Python, Perl. Optional programming strength in the following Hadoop tools: MapReduce, Pig, Hive, Hbase. Natural ability to communicate basic quantitative concepts clearly. Natural curiosity to research and identify possible quantitative solutions to common business problems. Technical knowledge of distributed computing platforms, and common data process flows from data instrumentation & generation, to ETL, to the data warehouse itself. Advanced degree (PhD or Masters) in an analytical or technical field (e.g. applied mathematics, statistics, physics, computer science, operation...