LCG T-TEST USING EUCLIDEAN METHOD

Last update: Jan 21, 2022

Related tags

Text Data & NLP lcgeuclideanmethod

Overview

LCG T-TEST USING EUCLIDEAN METHOD

Advanced Analytics and Growth Marketing Telkomsel

Project Supervisor : Rizli Anshari, General Manager of AAGM Telkomsel
Writer : Azka Rohbiya Ramadani, Muhammad Gilang, Demi Lazuardi

This project has been created for statistical usage, purposing for determining ATL takers and nontakers using LCG ttest and Euclidean Method, especially for internal business case in Telkomsel.

Background

In offering digital product, bussiness analyst must have considered what is the most suitable criterias of customers who have potential to buy the product. As an illustration, targetting gamers for games product campaign is better decision rather than targetting random customers without knowing customers behaviour. However, bussiness wouldn't make all the gamers as the campaign target, otherwise, market team deliberately wouldn't offer to several gamers randomly as comparison, called as Control Group. Therefore, it enables the team in measuring how success the campaign is.

After the campaign, there are product takers who are expected comes from campaign target. Additionaly, nontakers group is expected comes from Control Group and non-target customers. The analysis problems emerge afterwards, since nontakers might come from outside target. For this reason, our team made statistical technique in python algorithm to determine Control Group by applying euclideanmethod-combined t-test, in order for comparing takers and nontakers behaviour before the campaign, in this case we're focusing on revenue before. As a result, it enables bussiness analyst evaluating how success the campaign is.

Installation Guide

This algorith has been uploaded to pypi.org. Therefore, in order to get package, you can easely download using the following command

pip3 install lcgeuclideanmethod

Requirements

This python requires related package more importantly python_requires='>=3.1', so that package can be install Make sure the other packages meet the requirements below

pandas>=1.1.5,
numpy>=1.18.5,
scipy>=1.2.0,
matplotlib>=3.1.0,
statsmodels>=0.8.0

Usage Guide

1. EuclideanMethod

Input:
- df_takers : dataframe takers containing two columns, customers and revenue before campaign
- df_nontakers : dataframe nontakers containing two columns, customers and revenue before campaign
Output:
- summary : containing the number of expected Control Group populations based on max-p value, and other general information like average, std, max, min, etc.
- df_result : subprocess table to find p-value from random nontakers
- df_tukey : main result containing category customers category, based on summary calculation
- tukey : tukey HSD evaluation, readmore Tukey HSD

sample code

from lcgttest.lcgttest import EuclideanMethod
import pandas as pd

# where you put takers and nontakers file
df_takers = pd.read_csv('takers.csv')
df_nontakers = pd.read_csv('nontakers.csv')

model = EuclideanMethod(df_takers, df_nontakers)
model.run()

# output
print(model.summary)
print(model.df_result)
print(model.df_tukey)
print(model.tukey)

2. MapEuclideanMethod

This is like map function in python

Input:
- arr_df_takers : dataframe takers but in array form
- arr_df_nontakers : dataframe nontakers but in array form
- labels : labels of both takers and nontakers in array form
Output:
- df_summary : containing the number of expected Control Group populations based on max-p value, and other general information like average, std, max, min, etc in dataframe form.
- dict_df_result : subprocess table to find p-value from random nontakers in dicttionary type.
- dict_df_tukey : main result containing category customers category, based on summary calculation in dicttionary type.
- dict_tukey : tukey HSD evaluation, readmore Tukey HSD in dicttionary type.

sample code

from lcgttest.lcgttest import MapEuclideanMethod
import pandas as pd
import numpy as np

# where you put takers and nontakers file
arr_df_takers = np.array([df_takers, df_takers2, df_takers3])
arr_df_takers = np.array([df_nontakers, df_nontakers2, df_nontakers3])
labels = ['campaignA','campaignB','campaignC']

model2 = MapEuclideanMethod(arr_df_takers, arr_df_nontakers, label = labels )

# output
print(model.df_summary)
print(model.dict_df_result)
print(model.dict_df_tukey)
print(model.dict_tukey)

3. EuclideanMethodAscDesc

This is run twice MapEuclideanMethod ascending and descending (technique to randomize the nontakers samples)

Input:
- arr_df_takers : dataframe takers but in array form
- arr_df_nontakers : dataframe nontakers but in array form
- labels : labels of both takers and nontakers in array form
Output:
- df_summary : containing the number of expected Control Group populations based on max-p value, and other general information like average, std, max, min, etc in dataframe form.
- dict_df_result : subprocess table to find p-value from random nontakers in dicttionary type.
- dict_df_tukey : main result containing category customers category, based on summary calculation in dicttionary type.
- dict_tukey : tukey HSD evaluation, readmore Tukey HSD in dicttionary type.

sample code

from lcgttest.lcgttest import EuclideanMethodAscDesc
import pandas as pd
import numpy as np

# where you put takers and nontakers file
arr_df_takers = np.array([df_takers, df_takers2, df_takers3])
arr_df_takers = np.array([df_nontakers, df_nontakers2, df_nontakers3])
labels = ['campaignA','campaignB','campaignC']

model3 = EuclideanMethodAscDesc(arr_df_takers, arr_df_nontakers, label = labels )

# output
print(model3.df_summary)
print(model3.dict_df_result)
print(model3.dict_df_tukey)
print(model3.dict_tukey)

# additional input
print(model3.df_asc_desc_avg)

You might also like...

test

Lidar-data-decode In this project, you can decode your lidar data frame(pcap file) and make your own datasets(test dataset) in Windows without any hug

46 Dec 5, 2022

a test times augmentation toolkit based on paddle2.0.

Patta Image Test Time Augmentation with Paddle2.0! Input | # input batch of images / / /|\ \ \ # apply

110 Dec 3, 2022

Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks

TestRank in Pytorch Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks by Yu Li, Min Li, Qiuxia Lai, Ya

3 May 19, 2022

Installation, test and evaluation of Scribosermo speech-to-text engine

Scribosermo STT Setup Scribosermo is a LGPL licensed, open-source speech recognition engine to "Train fast Speech-to-Text networks in different langua

3 Jun 20, 2022

One Stop Anomaly Shop: Anomaly detection using two-phase approach: (a) pre-labeling using statistics, Natural Language Processing and static rules; (b) anomaly scoring using supervised and unsupervised machine learning.

One Stop Anomaly Shop (OSAS) Quick start guide Step 1: Get/build the docker image Option 1: Use precompiled image (might not reflect latest changes):

148 Dec 26, 2022

This repository details the steps in creating a Part of Speech tagger using Trigram Hidden Markov Models and the Viterbi Algorithm without using external libraries.

POS-Tagger This repository details the creation of a Part-of-Speech tagger using Trigram Hidden Markov Models to predict word tags in a word sequence.

1 Dec 9, 2021

Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech

epub2audiobook Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech Input examples qual a pasta do seu

7 Aug 25, 2022

Text-Summarization-using-NLP - Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization

Text-Summarization-using-NLP Text Summarization using NLP to fetch BBC News Arti

21 Aug 6, 2022

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

NeuroNER NeuroNER is a program that performs named-entity recognition (NER). Website: neuroner.com. This page gives step-by-step instructions to insta

1.6k Dec 27, 2022

LCG T-TEST USING EUCLIDEAN METHOD

Related tags

Overview

LCG T-TEST USING EUCLIDEAN METHOD

Advanced Analytics and Growth Marketing Telkomsel

Background

Installation Guide

Requirements

Usage Guide

1. EuclideanMethod

2. MapEuclideanMethod

3. EuclideanMethodAscDesc

You might also like...

test

a test times augmentation toolkit based on paddle2.0.

Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks

Installation, test and evaluation of Scribosermo speech-to-text engine

One Stop Anomaly Shop: Anomaly detection using two-phase approach: (a) pre-labeling using statistics, Natural Language Processing and static rules; (b) anomaly scoring using supervised and unsupervised machine learning.

This repository details the steps in creating a Part of Speech tagger using Trigram Hidden Markov Models and the Viterbi Algorithm without using external libraries.

Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech

Text-Summarization-using-NLP - Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

Owner

A method for cleaning and classifying text using transformers.

A method to generate speech across multiple speakers

ThinkTwice: A Two-Stage Method for Long-Text Machine Reading Comprehension

Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"

A CRM department in a local bank works on classify their lost customers with their past datas. So they want predict with these method that average loss balance and passive duration for future.

A NLP program: tokenize method, PoS Tagging with deep learning

Code for text augmentation method leveraging large-scale language models

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Ceaser-Cipher - The Caesar Cipher technique is one of the earliest and simplest method of encryption technique

Seq2seq attn - Use the Seq2Seq method to implement machine translation and introduce Attention mechanism to improve the results