Posts

Showing posts with the label Big Data Analytics

Tech Glossary : Defining Data Lake

Image
A data lake is a storage repository that holds a vast amount of raw data in its native format. While a hierarchical data warehouse stores data in files or folders, a data lake uses a flat architecture to store data. Each data element in a lake is assigned a unique identifier and tagged with a set of extended metadata tags. When a business question arises, the data lake can be queried for relevant data, and that smaller set of data can then be analyzed to help answer the question. The term data lake is often associated with Hadoop-oriented object storage. In such a scenario, an organization's data is first loaded into the Hadoop platform, and then business analytics and data mining tools are applied to the data where it resides on Hadoop's cluster nodes of commodity computers. Like big data, the term data lake is sometimes disparaged as being simply a marketing label for a product that supports Hadoop. Increasingly, however, the term is being accepted as a way to des...

Big Data Case Study: Transform Marketing To Take More Share

Big Data can change the way you market your products and services enabling you to achieve higher levels of personalization, customer satisfaction and experience. Join this live webcast for an interactive discussion about how EMC built a data-driven marketing science practice that is transforming how EMC does Marketing.  Big Data Case Study: Transform Marketing And Take More from EMC Academic Alliance

Big data could help US citizens save up to $450 billion in healthcare #McKAnalytics

Image
A new report from McKinsey estimates that big data could save the health care industry up to $450 billion, but it has to overcome a few obstacles first. Read more below :  The big data_revolution_in_healthcare from Rajesh Nambiar

Slidecast on Big Data Analytics

At International Conference on Cloud & Big Data Analytics (ICCBDA 2013) - David Dietrich Advisory Technical Education Consultant, Big Data & Data Science, EMC Corporation - delivered the pre-conference talk on Big Data Analytics. Here is the Slidecast for you -  Big Data Analytics from EMC Academic Alliance

Complete List of Big Ideas Videos from EMC Corp

Image
Big Idea videos from EMC are a great example of how to  Learning can be FUN . These  hand-drawn animations are created and narrated by EMC's Patricia Florissi, VP and Global Sales CTO. An Exhaustive list of these popular videos are below :   How Big is Big Data?  Big Ideas: Simplifying Cluster Architectures  Big Ideas: Demystifying Hadoop  Big Ideas: Simplifying High Performance Computing Architectures Big Ideas: Demystifying Fast Data  Big Ideas: Why Big Data Matters  Big Ideas: Simplifying Big Data Loading Big Ideas: Demystifying the Convergence of Big Data Architectures Big Ideas: When Data Needs Continuous Availability Big Ideas; Big Tech: Continuous Operations for VMware with EMC VPLEX Big Ideas; Big Tech: Continuous Operations for Oracle RAC with EMC VPLEX FLASH In A Flash Big Tech...

Enterprise Strategy Group: The Big Data Security Analytics Era is Here

This analyst report explains that organizations can no longer rely on preventive security systems, point security tools, manual processes, and hardened configurations to protect against targeted attacks. Henceforth, security management must be based on continuous monitoring and big data analysis for situational awareness and rapid decisions.

EMC Talks Flash Storage, Pivotal Initiatives in Q4 Financial Call

Image
We heard about Full Flash Storage and Pivotal initiative last year. In the Q4 2012 Financial Call, EMC gave some insight into its 2013 vision.  Technology drivers for 2013 EMC, VMware and Pivotal Initiatives key focus areas for 2013 More specifics will be available in the coming month. 

Transformation Story Map with Cloud and Big Data

Image
EMC Global Services has created an excellent  graphic visualization  on the journey to Cloud Computing and  Big Data. Check them below - 

Announcing : EMC Academic Forum 2013

Image
As a follow up from my earlier post on  International Conference on Cloud and Big Data Analytics #ICCBDA 2013 , we are pleased to invite you to Day 1 event - EMC Academic Forum 2013 -

EMC IT's Journey to Big Data with Business Analytics as a Service

Cloud represents the ideal infrastructure for dealing with vast quantities of data–or Big Data. Businesses are motivated to leverage Big Data because they know that locked inside of it are big opportunities. As a continuation to my post on  EMC IT's Journey to Cloud ,  i'm pleased to share a new w hitepaper on Business Analytics as a Service . This white paper examines how EMC is exploiting the Big Data opportunity with a new agile model for analytics and reporting. Business-Analytics-as-a-Service (BAaaS) significantly reduces total cost of ownership and provides predictive analytics proficiency and increased business agility. The paper details BAaaS architecture, deployment, results, best practices, and early adopter use cases.

Latest SBIC Report Reveals Accelerated Enterprise Adoption of Big Data, Mobile, Social Media and Cloud Computing Introduce Significant Gaps in Information Security Programs

RSA Security, released a special report from the Security for Business Innovation Council (SBIC) that assesses how disruptive innovations such as Big Data analytics , cloud computing, enterprise mobility and social media will transform enterprise IT and hammer away at the foundations of information security strategies in 2013.  The Council’s latest report details four strategies to help enterprises adapt information security programs to help enable business innovation over the next 12 months. These strategies include how to boost risk and business skills, court middle management, tackle IT supply chain issues and build tech-savvy action plans. The Council’s guidance will help enterprises face the impact of the technology adoption of cloud computing , social media , mobile and Big Data . The Council also outlines the major impacts of these trends for security teams and how to address them.  Cloud Computing – The accelerated adoption of cloud will push security conc...

Real Big Data Defined

Image
We live in a world of Big Data . This year alone, over a trillion gigabytes of new data will be created globally. Big Data presents a big challenge – but also exciting new opportunities for enterprises to rise above the competition.  Keeping this in mind, the teams at  EMC   and Greenplum launched  Real Big Data (  www.therealbigdata.com ) -  as a place for the latest news and discussions in the world of  Big Data . In the last few months, the team has built a series of guides with some very good information on Big Data .  You may download the same from the link below -   Big Data: A CIO’s Cut Out and Keep Guide -  Every now and then something comes along that has the potential to change the face of business as we know it. Currently big data is being touted as that thing, and CIOs everywhere need to get a grip on what it is and how it can benefit their company.  Big Data: Riding the Wave | A Guide for IT ...

Data Science Series Booklets

Image
EMC  Greenplum sponsored a series of  thought-provoking  booklets, edited by  innovation technology guru Peter Hinssen  and published by Across Technology.  To keep you  informed on a constant basis, they have created  the Data Science Series website ( www.datascienceseries.com ), offering you case stories  from your peers, valuable insight into market research and an overview of the Catalyst partners  that help EMC Greenplum bring the right building blocks to the market. Allowing you to build  the right ‘refinery’ for all the information that is coming your way. Make sure you don’t miss the installments of the series. Click below to read them one by one -  Information is the new oil -  Markets are fast disappearing, and being replaced by networks. Networks of intelligence. Consumers can’t be controlled anymore, and become active, and are turning markets into dynamic systems. The age-old subject of mar...

EMC Greenplum Chorus is open source - OpenChorus Project

Image
In the legacy analytics process, data scientists face challenges in accessing and sharing the right data. GreenplumChorus helps foster a complete data science ecosystem with best-of-breed analytics applications. As a social platform for collaborative data science, Greenplum Chorus users can increase productivity, decrease administrative burdens on IT infrastructures, and get better visibility and faster access to data through a single tool. EMC recently released the Greenplum Chorus source code under an Apache open source license through the  OpenChorus  Project. The OpenChorus Project will speed innovation and adoption of collaborative data science practices, helping organizations to drive greater business insight and economic value from Big Data. Making OpenChorus accessible to Data Scientists Greenplum and Kaggle joined forces to tackle the short supply and heavy demand for data scientists with an integration between the Kaggle data science c...

International Conference on Cloud and Big Data Analytics #ICCBDA 2013

Image
EMC Academic Alliance and PSG College of Technology ( Coimbatore, India ) will host the first " International Conference on Cloud and Big Data Analytics ( ICCBDA 2013) " during the first week of February 2013. The event will feature technical talks, paper presentation and various other contests.  Here are some of the key information to look fwd to -  Call for Technical paper is now open. Refer  http://www.psgtech.edu/iccbda2013  for more details. Deserving technical papers will not just be rewarded by EMC, but also published in  ICT Academy of Tamil Nadu  magazine Expert talks on sort after technology trends will be hosted as part of the conference.  Watch  https://www.facebook.com/EMCacademicalliance/events space for more details on the webinar series Follow  the details on   http://ictact.in/emc/index.html  Some important links to follow are -  https://twitter.com/EMCAcademics  #ICC...

Visualizations from The Human Face of Big Data

Image
Data visualization  is the study of the visual representation of data, meaning "information that has been abstracted in some schematic form, including attributes or variables for the units of information". The Human Face of Big Data   is a global, crowdsourced media project focusing on humanity's new ability to collect, analyze and visualize vast amounts of data in real time.   Data visualizations help paint a picture of how Big Data affects and measures our lives. These revealing new interactive data visualizations were created for The Human Face of Big Data project Mission Control. A team of designers and data scientists from EMC Cloud Services, EMC Greenplum, and Tableau Software analyzed one billion unique global tweets from Twitter, along with other data, to amplify themes and stories from the project. A data set of approximately 170 billion unique data elements was drawn from the billion tweets and loaded into an EMC Greenplum Data Computing A...

What is Big Data ?

Image
Big data  (also spelled Big Data) is a general term used to describe the voluminous amount of unstructured and semi-structured data a company creates -- data that would take too much time and cost too much money to load into a relational database for analysis. Although Big data doesn't refer to any specific quantity, the term is often used when speaking about petabytes and exabytes of data. To clarify matters, the three Vs of volume, velocity and variety are commonly used to characterize different aspects of big data. Big data is being enabled by inexpensive storage, a proliferation of sensor and data capture technology, increasing connections to information via the cloud and virtualized storage infrastructures, and innovative software and analysis tools. Big data is not a "thing" but instead a dynamic/activity that crosses many IT borders. IDC defines it this way:  Big data technologies describe a new generation of...

What is "The Human Face of Big Data" Project

Image
“ The Human Face of Big Data ,” the latest groundbreaking, globally crowdsourced initiative from Rick Smolan, the creator of the “Day in the Life” series. The project, made possible through primary sponsorship from EMC, is based on the premise that the real-time visualization of data collected by satellites, and by billions of sensors, RFID tags, and GPS-enabled cameras and smartphones around the world, is enabling humanity to sense, measure, understand and affect aspects of our existence in ways our ancestors could never have imagined in their wildest dreams.  The multifaceted project kicks off on September 25 with an eight-day “Measure Our World” event inviting people around the world to share and compare their lives in real time through an innovative smartphone app. The project also includes “Mission Control” events in New York, Singapore and London; “Data Detectives,” a global student initiative being conducted in conjunction with the TED organization; a stunning large-...

What makes a Data Scientist ?

Image
Some of the skill sets sort after, when looking for a Data Scientist are ( Source : Careers @ EMC )   Strong statistical foundation, with broad knowledge of deterministic and probabilistic statistical methods. Proficiency in at least one of the following statistical toolkits: SAS, R, SPSS, Matlab, Mahout/MADLib. Programming strength in at least one of the following languages: SQL, C/C++, Java, Python, Perl. Optional programming strength in the following Hadoop tools: MapReduce, Pig, Hive, Hbase. Natural ability to communicate basic quantitative concepts clearly. Natural curiosity to research and identify possible quantitative solutions to common business problems. Technical knowledge of distributed computing platforms, and common data process flows from data instrumentation & generation, to ETL, to the data warehouse itself. Advanced degree (PhD or Masters) in an analytical or technical field (e.g. applied mathematics, statistics, physics, computer science, operation...