A sample text widget

Etiam pulvinar consectetur dolor sed malesuada. Ut convallis euismod dolor nec pretium. Nunc ut tristique massa.

Nam sodales mi vitae dolor ullamcorper et vulputate enim accumsan. Morbi orci magna, tincidunt vitae molestie nec, molestie at mi. Nulla nulla lorem, suscipit in posuere in, interdum non magna.

Structured vs. Unstructured Big Data Q&A

I was asked recently to define structured vs unstructured data and how the different types of data were being managed within the enterprise. I thought I’d list my responses below in case you find it useful/interesting. As ever, feedback and debate welcome

What are the challenges seen with the two different types of unstructured […]

It’s Not (Just) Size That Matters

This Thursday, July 19, I’m going to be participating in a DM Radio panel titled “How Big Is Big? Why Big Data Comes in Various Sizes,” This will be my 3rd time participating in a DM Radio segment, and if it is anything like the last two, it should be an interesting discussion around the […]

Big Data and MDM – Where Quantity meets Quality

I presented a workshop at the MDM & Data Governance Summit in San Francisco this week on the topic of Big Data and Master Data Management (MDM). It was a particularly interesting topic for me because I have spent the last 8 years working as VP Product Marketing at Siperian (A leading MDM provider acquired […]

Getting Personal … Big Data from Facebook, Where is it all Stored?

As the Facebook IPO frenzy builds up to the pricing and Facebook starts trading this Friday (UPDATE: Facebook has priced at $38 giving it a market cap of about $104B), it got me to thinking about how much data I have uploaded/contributed to Facebook over the last 5 years. Turns out, you can get your […]

How Much Is That Hadoop Cluster Really Costing You?

Also published at http://rainstor.com/how-much-is-that-hadoop-cluster-really-costing-you/

Last month when we released our RainStor for Big Data Analytics product edition that runs natively on Hadoop, we raised a lot of eyebrows with two of the points that we were making:

Compression can dramatically reduce the TCO of Hadoop nodes needed SQL access to the compressed data in HDFS […]

Data Scientists and Big Data: Predicting Linsanity and Targeting Pregnant Teens

A couple of interesting articles ran through my #BigData tweetstream last week. Shout out to Ivan Chong (@ichong) at Informatica who I re-tweeted

Can this guy be considered a Data Scientist?online.wsj.com/article/SB1000… “Delivery Guy Who Saw Jeremy Lin Coming” #bigdata #overlooked #linsanity

“The Delivery Guy Who Saw Jeremy Lin Coming” tells the intriguing story of a […]

RIP Enterprise Data Warehouses? Long Live the Logical Data Warehouse?

I just read the August 2011 Gartner report titled Does the 21st-Century “Big Data” Warehouse Mean the End of the Enterprise Data Warehouse? (Subscription Required).

Clearly EDW’s have their shortcomings particularly in this new Big Data world. One of the reasons Hadoop and NoSQL repositories have gained in popularity is in part due to the […]

The Growing Universe of Big Data and Hadoop – A Graphical view of Funding, M&A and Partnerships (Updated May 2012)

Here is a quick graphic which I last updated May 2012 which visualizes some of the funding, M&A and partnerships related to the growing universe of Big Data and of course Hadoop.

Note that in certain categories there are a mix of companies and technologies. The graphic is a work in progress and does NOT […]

Big Data, Shared Nothing and Twin Babies

First of all, apologies for the lack of posts the last month or so. I’ve busy working on the launch of significant enhancements to the RainStor product and lots of exciting activity with our partners, including our recently announced relationship with Dell. My other focus has been with my fast growing identical twins Parker […]

Hadoop Compression – The Elephant That’s Not In The Room

This article was originally posted at Cloudtimes.org http://cloudtimes.org/hadoop-compression-the-elephant-thats-not-in-the-room/

We are living in the age of “Big Data” where billions of transactions, events or activities are generated through use of smartphones, web browsing, smartmeter sensors and more. Hadoop, MapReduce and a new generation of NoSQL technologies are helping us manage, transform, analyze and deal with the […]