Blog

A data science blog for Petroleum Engineering.

Topics covered

Subject ▸ machine learning

Using application microprocessors for seismic

24 October 2019

seismic, deep learning, machine learning

First practical application that I know of using the next thing after TPUs (Tensor Processing Units): ASICs or Application Specific Integrated Circuit. Ideal dedicated hardware for the massive seismic data and processing. Machine Learning (ML) algorithms build a mathematical model based upon representative sample data, known as ‘training data’, in order to make predictions or decisions without being explicitly programmed to perform the task. I limit my discussion here to supervised learning in the context of a potential application to seismic data image processing of a real marine seismic dataset, and then discuss how the computational scale of such exercises reinforce the need to develop computing technology that is customized for large ML problems.

Any Petroleum Engineer can do reproducible Machine Learning

22 September 2019

machine learning, petroleum engineering

As I prepare to release couple of examples using Generative Adversarial Networks (GANs) for creating synthetic datasets using rTorch, I found that I have several Rmarkdown notebooks loose out there while learning PyTorch. So, I decided to put these notebooks in a sort of an online ebook in GitHub. These notebooks range from unit tests for testing functions I implemented in *rTorch*, a wrapper of PyTorch, written in R, to small neural networks for logistic regression and linear regression.

A Generative Adversarial Networks (GAN) in rTorch for creating synthetic datasets. Season 1 Episode 1

29 August 2019

datasets, machine learning

Introduction I recently finished double-converting, from Python to R, and from PyTorch to rTorch, a very powerful script that creates synthetic datasets from real datasets. Synthetic datasets are based on real data but altered enough to keep the privacy, confidentiality or anonymity, of the original data. Since synthetic datasets conserve the original statistical characteristics of the real data, this could represent a breakthrough in petroleum engineering for sharing data, as the oil and gas industry is well known for the secrecy of its data.

R and Python commingled: RPyStats in Windows and Linux. Season 1, Episode 6

18 July 2019

computer science, machine learning

Alright! We have two Rmarkdown notebooks written in Python and R running PyTorch libraries. There are many more things that still we can do to improve the accuracy of the model to recognize hand-written digits. But before we continue improving the algorithm and the model, I wanted to make a brief pause and showed you something that really made me jump ship from Windows to Linux. It is related to R, Python and data science.

R and Python commingled: Creating a PyTorch project with RPyStats. Season 1, Episode 5

17 July 2019

computer science, machine learning

In the previous episode we ended up calculating the accuracies for the MNIST digits model using PyTorch libraries called from a Rmarkdown notebook written in Python. In this episode, we will run the same example but in another notebook written in R named *mnist_digits_rstats.Rmd*, which was saved in the folder ./work/notebooks in the previous session. With RStudio open, let’s click on the file to open it. You may notice some familiarity between the code in this notebook *mnist_digits_rstats.

R and Python commingled: Creating a PyTorch project with RPyStats. Season 1, Episode 4

16 July 2019

computer science, machine learning

Introduction Now that you have your machine with the core applications installed (Season1, Episode 3), in this episode we will cover the steps to create a machine learning project commingling R and Python. I will call this project rpystats-apollo11. What rpystats-apollo11 will do is training a neural network using the MNIST dataset to recognize hand-written digits. I will write the code in pure Python first, and then in R in two separate Rmarkdown notebooks running in RStudio.

Hinton's Coursera course Neural Networks for Machine Learning

16 January 2019

machine learning, courses

The machine learning course is little heavy in math but I would recommend to, at least, taking a look at some of the lectures to find out what AI and ML are about. You can browse as much as you want, for free, in Coursera.

Machine Learning for lithofacies classification from well logs by Paolo Dell'Aversana

16 January 2019

machine learning

Don’t miss reading this paper on an application of machine learning in petroleum engineering by Paolo Dell’Aversana.

References

Reinforcement Learning. Not an easy subject

2 January 2019

machine learning

Not an easy subject nor as common as supervised and unsupervised learning but key for making better intelligent machines.

Book by Andrew Barto and Richard S. Sutton: https://lnkd.in/ewb_MeJ

Link to file

#petroleumengineering #machinelearning #digitalpetroleum #datascience

Is Windows in the comfort zone?

29 December 2018

machine learning, linux, reproducibility

There is also a very controversial issue in this article such as Windows ecosystem is the comfort zone of developers and people alike. If you are serious about Application Development in Data Science, Machine Learning and Artificial Intelligence -, then you have to be get out of the comfort zone and start doing it in Linux. The terminal means reproducibility. Start with practicing Linux an hour a day. No?

All Blog posts by date