New in Big Data Discovery 1.2 is the addition of BDD Shell, an integration point with Python. This exposes the datasets and BDD functionality in a Python and PySpark environment, opening up huge possibilities for advanced data science work on BDD datasets. With the ability to push back to Hive and thus BDD data modified in this environment, this is important functionality that will make BDD even more useful for navigating and exploring (more...)
New in Big Data Discovery 1.2 is the addition of BDD Shell, an integration point with Python. This exposes the datasets and BDD functionality in a Python and PySpark environment, opening up huge possibilities for advanced data science work on BDD datasets, particularly when used in conjunction with Jupyter Notebooks. With the ability to push back to Hive and thus BDD data modified in this environment, this is important functionality that will make BDD (more...)
It’s time to announce the 3rd episode of Gluent New World webinar series! This time Gwen Shapira will talk about Kafka as a key data infrastructure component of a modern enterprise. And I will ask questions from a old database guy’s viewpoint :)
Apache Kafka and Real Time Stream Processing
Amazon Web Services (AWS) recently released a product called AWS Data Migration Services (DMS) to migrate data between databases.
I have used AWS DMS to try a migration from a source MySQL database to a target MySQL database, a homogeneous database migration.
The DMS service lets you use a resource in the middle Replication Instance - an automatically created EC2 instance - plus source and target Endpoints. Then you move data from the source (more...)
This is the next post in my series about Oracle GoldenGate Big Data adapters. Here is list of all posts in the series:
- GoldenGate 12.2 Big Data Adapters: part 1 – HDFS
- GoldenGate 12.2 Big Data Adapters: part 2 – Flume
- GoldenGate 12.2 Big Data Adapters: part 3 – Kafka
- GoldenGate 12.2 Big Data Adapters: part 4 – HBASE
In this post I am going to explore HBASE adapter for GoldenGate. (more...)
Data is exploding and Exadata is catching up. With the proliferation of cloud technology and in-memory databases; Oracle Exadata X6-2 and X6-8 has it all. It seems to be an ideal platform for hyper-convergence for any data center running Oracle products.
Following are some of the salient features of Oracle X6:
- The compute nodes have twenty two-core Intel Xeon E5-2699 v4 processors
- The memory is DDR4 and of size 256Gb and it can be expanded (more...)
It’s time to announce the 2nd episode of the Gluent New World webinar series!
The Gluent New World webinar series is about modern data management: architectural trends in enterprise IT and technical fundamentals behind them.
GNW02: SQL-on-Hadoop : A bit of History, Current State-of-the-Art, and Looking towards the Future
- This GNW episode is presented by no other than Mark Rittman, the co-founder & CTO of Rittman Mead and an all-around guru of enterprise BI!
Although we are still in stealth mode (kind-of), due to the overwhelming requests for information, we decided to publish a video about what we do :)
It’s a short 5-minute video, just click on the image below or go straight to http://gluent.com:
And this, by the way, is just the beginning.
Gluent is getting close to 20 people now, distributed teams in US and UK – and we are still hiring!
Hi, it took a bit longer than I had planned, but here’s the first Gluent New World webinar recording!
You can also subscribe to our new Vimeo channel here – I will announce the next event with another great speaker soon ;-)
A few comments:
- Slides are here
- I’ll figure a good way to deal with offline follow-up Q&A later on, after we’ve done a few of these events
If you like this (more...)
In this blog post I will show you the configurations that are necessary for RStudio Server to work with Oracle R Enterprise on your Oracle Database server. In theory if you have just installed ORE and then RStudio Server, everything should work, but if you encounter any issues then check out the following.
Before I get started make sure to check out my previous blog posts on installing R Studio Server. The first blog post (more...)
In a previous blog post I showed how you can install and get started with using RStudio on a server by using RStudio Server. My previous post showed how you could do that on the Oracle BigDataLite VM. On this VM everything was nicely scripted and set up for you. But when it comes to installing it on a different server, well things can be a bit different.
The purpose of this blog post is (more...)