Finding Lane Lines on the Road

In this project the aim is detect lane lines in images using Python and OpenCV. OpenCV means "Open-Source Computer Vision", which is a package that has many useful tools for analyzing images.

Frist, we need to import some packages.

### importing some useful packages
import matplotlib.pyplot as plt
import matplotlib …
more ...

Association Matrix for Character Variables - SAS implementation

This program generates a correlation (association) matrix using the Cramer's V statistic for character variables.

Illustration:

  • Dataset used in this code: claim_history.sas7bdat or claim_history.csv
  • The claim_history.sas7bdat dataset comes from the help library of SAS Enterprise-Miner version 7.1
  • Data description: observations: 10302, Variables: 27

Note: he NOSOURCE …

more ...

Stepwise Regression: Forward Selection - SAS Implementation

Purpose:

The macro performs an automated forward selection and stepwise variable selection process for PROC GENMOD which does not come with model selection options. Note that the GENMOD procedure in SAS versions prior to 9.4 does not come with model selection options.

Introduction:

SAS users of SAS 9.2 …

more ...

Stepwise Regression: Backwards Elimination - SAS Implementation

Purpose:

I created the following SAS code in order to implement the Stepwise Regression algorithm in SAS. This SAS macro performs an automated backward elimination variable selection process for PROC GENMOD which does not come with model selection options. Note that the GENMOD procedure in SAS versions prior to 9 …

more ...

Pure premium regression model using XGBoost and random forest

Introduction

The most common types of ratemaking models in the insurance industry are frequency, severity, and pure premium models. Frequency models predict how often claims are made, and severity models predict claim amounts. The term pure premium is unique to insurance; it is the portion of the company’s expected …

more ...

Optimal Binning for Interval Variables

Auxiliary Uses of Decision Trees:

Optimal Binning of Continuous Variables Using Decision Trees in SAS E-Guide

Introduction

An interval or continuous variable has infinite number of discrete values, such as the operator age or vehicle length. Also, these continuous variables, very frequently, have missing observations, outliers, repeated observations etc. that …

more ...

Optimal Binning for Nominal Variables

Auxiliary Uses of Decision Trees:

Optimal Binning of Nominal Variables with Many Levels Using Decision Trees in SAS E-Guide

Introduction

A persistent issue in data mining is dealing with highly dimensional data. Paradoxically adding more data (in terms of numbers of variables) does not necessarily improve the ability to develop …

more ...

Bootstrapping Inference in Regreesion

Summary

The purpose of this notebook is illustrate how apply bootstrap a confidence interval for the slope of the MATH independent variable in a multivariable regression environment using the GENMOD procedure in SAS.

This technique is useful in an inference problem where the interpretation of the Beta parameter is critical …

more ...