Check out the video of US President talking about Data Science and the first Chief Data Scientist of the USA talks about his mission.
For those of you who missed the event I have posted some pictures below. We have recorded (more...)
While we can anyways have a case statement for each of the numbers i.e. 1,2,3,4,5,6,7,8,9,0, I found an easy way out by using the ASCII function.
ASCII function (more...)
In this blog post we look at how we can address a shortcoming in the Hive ALTER TABLE statement using parameters and variables in the Hive CLI (Hive 0.13 was used).
There’s a simple way to query Hive parameter values directly from CLI
You simply execute (without specifying the value to be set):
SET hive.exec.compress.output; --- hive.exec.compress.output=false
You may use those parameters directly in (more...)
Version 18.104.22.1689 supports the following Essbase versions:
This issue, which often manifested itself with errors in the Essbase error 10420xx range, was caused by how the Essbase Java API communicated with the server. In essence, whenever a piece of information was needed, the Essbase Java API grabbed a port from the pool of available ports, did its business, (more...)
Join MapR and Sonra for the Hadoop User Group Ireland Meetup on 23 February at 6 pm at the Wayra offices (O2/Three building). You’ll learn more about the MapR distribution for Apache Hadoop through use cases, case studies and an introduction to the benefits of using the MapR platform.
Come by for this content-packed first event ending with the opportunity to socialise over beer and pizza kindly provided by Sonra.
What is (more...)
A few days ago I recorded a 2-minute tech tip with Bob Hubbard of OTN.
My topic was on Predictive Queries which are a new feature in the Oracle 12c Database.
The challenge was to talk about the topic within 2 minutes. That is a lot harder than you time. Believe me.
Check out the video on the Bobs OTN 2-Minute Tech Tip channel or click on the link below.
It was fun doing this (more...)
Here is the complete text of the article - Big changes ahead for India's IT majors:
Hidden among the noise surrounding the big three of the Indian IT industry - TCS, Wipro, and Infosys - was a very interesting sliver of signal that points to possibly big changes on the horizon. Though Cognizant should be counted among these (more...)
In this blog post I want to show you how you can go about evaluating your classification models that you develop using Oracle Data Miner (part of SQL Developer).
What I'm not going to show you here is how to develop classification models using (more...)
If you want to upskill and get certified on Hadoop you can now do so for free. Thanks to MapR. Over the next couple of weeks they are rolling out their on-demand Hadoop training courses. The highlight of the first batch of courses is Developing Hadoop Applications on Yarn.
When you are working on building classification models you will need some ways of measuring the effectiveness of each model that you will build. This measurement/evaluation is perform during the model build process.
Typically the model build process consists of 2 steps (I'm assuming all data preparation etc has been completed:
- Build the model: During this step you will feed in a portion of your data set to the data mining algorithm. Typical this data (more...)
As the volume of the data in your tables grows, particularly in the big data world, you may run into some memory issues or package restrictions with pulling down the tables to your R environment.
Some of the R packages and drivers have some recommended numbers or limits for the number of records that can be fetched.
In the following example I'm looking at downloading a table with 300K records from an Oracle Database. I've (more...)
This is the full text of the article:
Using the data from pwcmoneytree.com and easy to use dashboard software we perform analytics on a huge dataset that spans 20 years of Venture capital investment data from 1995 onward. Having data that goes far into the history should give us enough to extract the necessary analytical juice out of it.
The year 2000 was definitely the peak for VC investment craziness. A whopping 105 Billions was pumped into startups and bringing them (more...)
The following is not something new but something that I have put together this evening, and I mainly make the following available as a note to myself and what I did. If you find it useful or interesting then you are more than welcome to use and share. You will also find lots of similar solutions on the web.
This evening I was playing around the the Text Mining (tm) package in R. So I (more...)