Posts by Tag

R Projects

Graph

26 minute read

Extra and last post! Graph netowrk: a brief analysis. Privacy policy for Uber and Lyft Conclusions 1. Graph netowrk: a brief analysis. library(kni...

Xgb

11 minute read

XGboost To conclude with the estimation part of the project we will use the Xgboost methodology to compare results with the previous estimations. Once again...

Gbm

17 minute read

Gradient Boosting Machine (GBM) Whereas random forests build an ensemble of deep independent trees, GBMs build an ensemble of shallow and weak successive tr...

Rf

16 minute read

Random Forest in R Due to computational restrictions, this time we select randomly 260.000 observations from the original data, although this sub-sample re...

Trees

6 minute read

Tree classifiers in R library(data.table) library(tree) library(readr) NY <- read_delim("C:/Users/D/Desktop/NY.csv", ";", escape_double = FALSE, trim_ws...

Pcr&pls

5 minute read

Performing Principal Components Regression (PCR) and Partial Least Squares Regression (PLS) in R For UBER’s dataset library(tidyverse) library(caret) libra...

Load_clean

5 minute read

Part 1. Cleaning the database R code block: library(readr) NY <- read_delim("C:/Users/DDD/Desktop/NY.csv", ";", escape_double = FALSE, ...

Back to Top ↑

Python Projects

PCR & PLS

7 minute read

Regression, Data Science, Python

Clean_and _eda

6 minute read

Uber dataset in Rusia with Python This idea is based on replying the exercises from the previous post, but this time I prefered in a different database, thi...

Neural

5 minute read

Artificial Neural networks on python In this exercise we simulate the Artificial Neural Networks (ANN) in class, this is an easy example and it is applied i...

Back to Top ↑

Statistics

Fluctuations in coin tossing

11 minute read

Fluctuations in coin tossing The ideal coin-tossing game will be described in the terminology of random walks which is better suited for generalizations. Fo...

Combinatorial analysys

30 minute read

Combinatorial analysis Probability Theory: An Introduction “What are the chances…” is an expression you probably use very often. Determining the chances of...

Tes

1 minute read

test for jupyter def trapezoidal(f, a, b, n): h = float(b - a) / n s = 0.0 s += f(a)/2.0 for i in range(1, n): s += f(a + i*h) s...

Back to Top ↑

Regression

PCR & PLS

7 minute read

Regression, Data Science, Python

Back to Top ↑

Machine Learning

Toptal challenge

11 minute read

Pandas and classification exercise import numpy as np import pandas as pd from sklearn.tree import DecisionTreeClassifier from sklearn.model_selection impor...

Back to Top ↑

data wrangling

Back to Top ↑

data science

Back to Top ↑

messy data

Back to Top ↑

Python

Back to Top ↑