If you want to upskill and get certified on Hadoop you can now do so for free. Thanks to MapR. Over the next couple of weeks they are rolling out their on-demand Hadoop training courses. The highlight of the first batch of courses is Developing Hadoop Applications on Yarn.
These days Hortonworks with their IPO and Cloudera sitting on $1bn of cash grab all the headlines. However,the real visionary in the field is someone else. Someone blasting the previous world record in TeraSort . A Hadoop distribution on both Amazon Web Services and the Google Compute Engine. A company that Google is invested in. While their competitors have been in skirmishes with each other, MapR has been quietly working away and innovating.
MapR-FS: Features and (more...)
My friend and colleague, ACE Director Brendan Tierney, has recently published the reference book Predictive Analytics Using Oracle Data Miner. It is the first comprehensive book on the subject matter. The book is primarily aimed at the Oracle Data Scientist/Data Miner. The other target audience are Oracle developers who implement the data mining models created by the Data Scientists in their applications, e.g. OBIEE. Some of the areas covered are also relevant for Oracle (more...)
There are various tell-tale signs that something is not quite right with your Oracle Data Integrator implementation. Does your ODI architecture suffer from the following symptoms?
- The ETL seems to take forever
- Some data flows take more than 30 minutes
- Your developers take ages to implement new data flows or change existing mappings
- The ETL breaks at least once a week
- No meaningful and consistent set of naming standards and coding conventions has been implemented.
I will give a presentation on 24 September at the Jury’s Inn in Dublin on the next generation of Big Data 2.0 tools and architecture.
Over the last two years there have been significant changes and improvements in the various Big Data frameworks. With the release of Yarn (Hadoop 2.0) the most popular of these platforms now allows you to run mixed workloads. Gone are the days when Hadoop was only good for (more...)
Both ODI and the Hadoop ecosystem share a common design philosophy. Bring the processing to the data rather than the other way around. Sounds logical, doesn’t it? Why move Terabytes of data around your network if you can process it all in the one place. Why invest millions in additional servers and hardware just to transform and process your data?
In the ODI world this approach is known as ELT. ELT is a marketing concept (more...)
You may have wondered why we were quiet over the last couple of weeks? Well, we locked ourselves into the basement and did some research and a couple of projects and PoCs on Hadoop, Big Data, and distributed processing frameworks in general. We were also looking at Clickstream data and Web Analytics solutions. Over the next couple of weeks we will update our website with our new offerings, products, and services. The article below summarises (more...)