💬 Python scripts to parse Messenger, Hangouts, WhatsApp and Telegram chat logs into DataFrames.

Florian

Last update: Jan 2, 2023

Related tags

Data Analysis cloud parse telegram extract telegram-api plot histogram google-hangouts facebook-messenger wordcloud whatsapp takeout parsers whatsapp-parser hangouts-logs

Overview

Chatistics

Python 3 scripts to convert chat logs from various messaging platforms into Pandas DataFrames. Can also generate histograms and word clouds from the chat logs.

Changelog

10 Jan 2020: UPDATED ALL THE THINGS! Thanks to mar-muel and manueth, pretty much everything has been updated and improved, and WhatsApp is now supported!

21 Oct 2018: Updated Facebook Messenger and Google Hangouts parsers to make them work with the new exported file formats.

9 Feb 2018: Telegram support added thanks to bmwant.

24 Oct 2016: Initial release supporting Facebook Messenger and Google Hangouts.

Support Matrix

Platform	Direct Chat	Group Chat
Facebook Messenger	✔	✘
Google Hangouts	✔	✘
Telegram	✔	✘
WhatsApp	✔	✔

Exported data

Data exported for each message regardless of the platform:

Column	Content
timestamp	UNIX timestamp (in seconds)
conversationId	A conversation ID, unique by platform
conversationWithName	Name of the other people in a direct conversation, or name of the group conversation
senderName	Name of the sender
outgoing	Boolean value whether the message is outgoing/coming from owner
text	Text of the message
language	Language of the conversation as inferred by langdetect
platform	Platform (see support matrix above)

Exporting your chat logs

1. Download your chat logs

Google Hangouts

Warning: Google Hangouts archives can take a long time to be ready for download - up to one hour in our experience.

Go to Google Takeout: https://takeout.google.com/settings/takeout
Request an archive containing your Hangouts chat logs
Download the archive, then extract the file called Hangouts.json
Move it to ./raw_data/hangouts/

Facebook Messenger

Warning: Facebook archives can take a very long time to be ready for download - up to 12 hours! They can weight several gigabytes. Start with an archive containing just a few months of data if you want to quickly get started, this shouldn't take more than a few minutes to complete.

Go to the page "Your Facebook Information": https://www.facebook.com/settings?tab=your_facebook_information
Click on "Download Your Information"
Select the date range you want. The format must be JSON. Media won't be used, so you can set the quality to "Low" to speed things up.
Click on "Deselect All", then scroll down to select "Messages" only
Click on "Create File" at the top of the list. It will take Facebook a while to generate your archive.
Once the archive is ready, download and extract it, then move the content of the messages folder into ./raw_data/messenger/

Unfortunately, WhatsApp only lets you export your conversations from your phone and one by one.

On your phone, open the chat conversation you want to export
On Android, tap on ⋮ > More > Export chat. On iOS, tap on the interlocutor's name > Export chat
Choose "Without Media"
Send chat to yourself eg via Email
Unpack the archive and add the individual .txt files to the folder ./raw_data/whatsapp/

The Telegram API works differently: you will first need to setup Chatistics, then query your chat logs programmatically. This process is documented below. Exporting Telegram chat logs is very fast.

2. Setup Chatistics

First, install the required Python packages using conda:

conda env create -f environment.yml
conda activate chatistics

You can now parse the messages by using the command python parse.py.

By default the parsers will try to infer your own name (i.e. your username) from the data. If this fails you can provide your own name to the parser by providing the --own-name argument. The name should match your name exactly as used on that chat platform.

# Google Hangouts
python parse.py hangouts

# Facebook Messenger
python parse.py messenger

# WhatsApp
python parse.py whatsapp

Create your Telegram application to access chat logs (instructions). You will need api_id and api_hash which we will now set as environment variables.
Run cp secrets.sh.example secrets.sh and fill in the values for the environment variables TELEGRAM_API_ID, TELEGRAMP_API_HASH and TELEGRAM_PHONE (your phone number including country code).
Run source secrets.sh
Execute the parser script using python parse.py telegram

The pickle files will now be ready for analysis in the data folder!

For more options use the -h argument on the parsers (e.g. python parse.py telegram --help).

3. All done! Play with your data

Chatistics can print the chat logs as raw text. It can also create histograms, showing how many messages each interlocutor sent, or generate word clouds based on word density and a base image.

Export

You can view the data in stdout (default) or export it to csv, json, or as a Dataframe pickle.

python export.py

You can use the same filter options as described above in combination with an output format option:

  -f {stdout,json,csv,pkl}, --format {stdout,json,csv,pkl}
                        Output format (default: stdout)

Histograms

Plot all messages with:

python visualize.py breakdown

Among other options you can filter messages as needed (also see python visualize.py breakdown --help):

  --platforms {telegram,whatsapp,messenger,hangouts}
                        Use data only from certain platforms (default: ['telegram', 'whatsapp', 'messenger', 'hangouts'])
  --filter-conversation
                        Limit by conversations with this person/group (default: [])
  --filter-sender
                        Limit to messages sent by this person/group (default: [])
  --remove-conversation
                        Remove messages by these senders/groups (default: [])
  --remove-sender
                        Remove all messages by this sender (default: [])
  --contains-keyword
                        Filter by messages which contain certain keywords (default: [])
  --outgoing-only       
                        Limit by outgoing messages (default: False)
  --incoming-only       
                        Limit by incoming messages (default: False)

Eg to see all the messages sent between you and Jane Doe:

python visualize.py breakdown --filter-conversation "Jane Doe"

To see the messages sent to you by the top 10 people with whom you talk the most:

python visualize.py breakdown -n 10 --incoming-only

You can also plot the conversation densities using the --as-density flag.

Word Cloud

You will need a mask file to render the word cloud. The white bits of the image will be left empty, the rest will be filled with words using the color of the image. See the WordCloud library documentation for more information.

python visualize.py cloud -m raw_outlines/users.jpg

You can filter which messages to use using the same flags as with histograms.

Development

Install dev environment using

conda env create -f environment_dev.yml

Run tests from project root using

python -m pytest

Improvement ideas

Parsers for more chat platforms: Discord? Signal? Pidgin? ...
Handle group chats on more platforms.
See open issues for more ideas.

Pull requests are welcome!

Social medias

Projects using Chatistics

Meet your Artificial Self: Generate text that sounds like you workshop

Credits

Main contributors: @MasterScrat, @mar-muel, @manueth, @bmwant
Word cloud generated using https://github.com/amueller/word_cloud
Stopwords from https://github.com/6/stopwords-json
Code under MIT license

Comments

= 1" when parsing Messenger chat logs">

"ValueError: ordinal must be >= 1" when parsing Messenger chat logs

I still haven't had success yet with parsing the FB messages. One issues was that timestamp = pd.to_datetime(content, format='%A, %B %d, %Y at %H:%M%p', exact=False).strftime("%s") had an invalid format string. I changed it to "%S" since I assume that's what you meant. However, I now get the following issue when running analyse.py.

Traceback (most recent call last):
  File "analyse.py", line 90, in <module>
    print(plot)
  File "C:\Users\asros\Anaconda3\envs\py2\lib\site-packages\ggplot\ggplot.py", line 116, in __repr__
    self.make()
  File "C:\Users\asros\Anaconda3\envs\py2\lib\site-packages\ggplot\ggplot.py", line 641, in make
    self.apply_axis_labels()
  File "C:\Users\asros\Anaconda3\envs\py2\lib\site-packages\ggplot\ggplot.py", line 231, in apply_axis_labels
    labels.append(self.xtick_formatter(label_text))
  File "C:\Users\asros\Anaconda3\envs\py2\lib\site-packages\matplotlib\dates.py", line 542, in __call__
    dt = num2date(x, self.tz)
  File "C:\Users\asros\Anaconda3\envs\py2\lib\site-packages\matplotlib\dates.py", line 445, in num2date
    return _from_ordinalf(x, tz)
  File "C:\Users\asros\Anaconda3\envs\py2\lib\site-packages\matplotlib\dates.py", line 260, in _from_ordinalf
    dt = datetime.datetime.fromordinal(ix).replace(tzinfo=UTC)
ValueError: ordinal must be >= 1

bug

opened by arosen93 12

PCDATA invalid Char value

Had this issue when running the parser and to solve is just add proper encoding on line 36 if anyone happens to face the same problem:

archive = etree.parse(filePath + "/" + filename,
                            etree.XMLParser(encoding='utf-8',
                                            ns_clean=True,
                                            recover=True))

should submit a PR?

opened by Carmezim 9

Module dependencies

Hey there,

So I've cloned the project a few times before, and even though I've gotten it to work, Chatistics doesn't work out of the box for me. My environment variables were already set, but LXML was giving me trouble, as where some dependencies that aren't listed in the requirements.txt. I noticed that there's a section for libxml in the README, but I just wanted to ask if you guys were taking any pull requests to add some fallbacks in case certain modules aren't already installed. Thanks!

opened by farisachugthai 6
Problem activating Chatistics shell
I had it working on the 17th of January but I updated the master branch today and now conda activate chatistics fails with:

CommandNotFoundError: Your shell has not been properly configured to use 'conda activate'. To initialize your shell, run $ conda init <SHELL_NAME> Currently supported shells are: - bash - fish - tcsh - xonsh - zsh - powershell See 'conda init --help' for more information and options. IMPORTANT: You may need to close and restart your shell after running 'conda init'.

Before trying this I deleted the existing environment out of ~/anaconda/envs/ and ran conda env create -f environment.yml

I also updated to Catalina in the meantime if that makes any difference.

Any ideas as to how I can get this working?

Edited to remove env from conda activate chatistics.
opened by willem-h 5
Add Telegram support
Rationale

Addressing issue #12

Important

Relies on telethon, therefore codebase was migrated to Python 3. Anyway, IMO we should develop in Python 3.

Work with telegram requires registering your own app to get an access token

Added logging

Bump some requirements

A lot of refactor and code style fixes, sorry :guardsman:
opened by bmwant 4
ValueError: time data 'Sunday, 11 March 2012 at 15:11 UTC+01' does not match format '%A, %B %d, %Y at %H:%M%p' (search)

I get a problem when trying to parse the facebook messenger files. I get File "parse_messenger.py", line 76, in <module> timestamp = pd.to_datetime(content, format='%A, %B %d, %Y at %H:%M%p', exact=False).strftime("%s")

and

'ValueError: time data 'Sunday, 11 March 2012 at 15:11 UTC+01' does not match format '%A, %B %d, %Y at %H:%M%p' (search)

Is this perhaps due to different time stamps formats or that they are older?

opened by edvinli 4
Fix whatsapp parsing

infer_datetime_regex withing whatsapp.py handles single digit month day values in the whatsapp timestamp incorrectly. Causing the parser to ignore a huge amount of messages. This error depends on the first time value encountered.

I'm not quite sure why the regex isn't a constant, but this pull request should fix the problem while maintaining the flexibility of the original function.

Before: After:

opened by Denpeer 3
ModuleNotFoundError

Having ModuleNotFoundError while running the command

python parsers/messenger.py --own-name "John Doe"

There's __ init __.py in parsers subdirectory, still facing this issue.

Traceback (most recent call last): File "parsers/messenger.py", line 11, in <module> from parsers import log ModuleNotFoundError: No module named 'parsers'
bug enhancement

opened by yatharth96 3

ValueError with datetime format

I am having the following issue with Pandas trying to read the datetime. Do you know why it's having trouble reading the full datetime?

Traceback

105 Person Name (group? False )
Traceback (most recent call last):
  File "parse_messenger.py", line 71, in <module>
    timestamp = pd.to_datetime(content[:-7], format='%A, %B %d, %Y at %H:%M%p').strftime("%s")
  File "C:\Users\asros\Anaconda3\envs\py2\lib\site-packages\pandas\core\tools\datetimes.py", line 382, in to_datetime
    result = _convert_listlike(np.array([arg]), box, format)[0]
  File "C:\Users\asros\Anaconda3\envs\py2\lib\site-packages\pandas\core\tools\datetimes.py", line 306, in _convert_listlike
    raise e
ValueError: time data 'Monday, January 22, 2018 at 6:4' does not match format '%A, %B %d, %Y at %H:%M%p' (match)

Text:

</style><title>Conversation with Friend Name</title></head><body><a href="html/messages.htm">Back</a><br /><br /><div class="thread"><h3>Conversation with Friend Name</h3>Participants: Friend Name<div class="message"><div class="message_header"><span class="user">Friend Name</span><span class="meta">Monday, January 22, 2018 at 6:46pm CST</span></div></div><p>Oh wow</p><div class="message"><div class="message_header"><span class="user">My Name</span><span class="meta">Monday, January 22, 2018 at 6:44pm CST</span></div></div><p>blah blah blah message was here</p><div class="message"><div class="message_header"><span class="user">Friend Name</span><span class="meta">Monday, January 22, 2018 at 6:40pm CST</span></div></div><p>

opened by arosen93 3

WhatsApp parse error

I have exported the WhatsApp chat from a single conversation, and after running python parse.py whatsapp --own-name "firstname lastname" each row in the "senderName" column has a prefix of AM/PM (from the timestamp), e.g. "AM - firstname lastname" or "PM - firstname lastname". This leads to errors in visualization, etc.

Here are a few lines from the WhatsApp export for reproducibility:

9/18/18, 2:45 PM - firstname1 lastname1: I'm sorry! I was talking to people throughout the lunch break! 9/18/18, 4:24 PM - firstname2 lastname2: I'm glad it is going so well! 9/18/18, 4:25 PM - firstname2 lastname2: Here's a quick outtake from the filming 9/18/18, 4:25 PM - firstname2 lastname2: <Media omitted>
bug

opened by cameronweibel 2
No input files found under raw_data/messenger
I followed the steps outlined in the readme.

Downloaded messages from facebook in JSON format. Unzipped and put the content from the messages folder in my raw_data/messenger directory.

When I run python parse.py messenger I get this error:

2020-01-23 04:27:34,164 [INFO ] [parsers.mess]: Parsing Facebook messenger data... 2020-01-23 04:27:34,165 [ERROR] [parsers.mess]: No input files found under raw_data/messenger
opened by KolbySisk 2
Clarify where to export messenger data to, and fix export.py to pkl error
I mistakenly exported my Facebook messenger data content to the messenger folder, instead of the entire messages folder i had unzipped from the Facebook download file. I saw in an other thread, that others had also done this. The readme could be changed so this would be avoided.

If i run export.py -format pkl then i get an error saying:

Traceback (most recent call last): File "export.py", line 59, in <module> main() File "export.py", line 52, in main with open(f_name, 'wb', encoding="utf8") as f: ValueError: binary mode doesn't take an encoding argument

The error can be avoided by not setting encoding in the open() function.
opened by aguldbrandsen 0
When you parse logs from a group chat

2020-09-26 19:38:57,867 [INFO ] [matplotlib.f]: Generating new fontManager, this may take some time... 2020-09-26 19:39:01,947 [INFO ] [parsers.mess]: Parsing Facebook messenger data... 2020-09-26 19:39:01,948 [INFO ] [parsers.mess]: Trying to infer own_name from data... 2020-09-26 19:39:02,037 [INFO ] [parsers.mess]: Successfully inferred own-name to be User1 2020-09-26 19:39:02,061 [INFO ] [parsers.mess]: Group chats are not supported yet. 2020-09-26 19:39:02,079 [INFO ] [parsers.mess]: Group chats are not supported yet. 2020-09-26 19:39:02,100 [INFO ] [parsers.mess]: Group chats are not supported yet. 2020-09-26 19:39:02,109 [INFO ] [parsers.mess]: 0 messages parsed. 2020-09-26 19:39:02,109 [INFO ] [parsers.mess]: Nothing to save.

Can I kindly ask if there will be group chat support in the near future ?

@teamfridge not sure I understand. When you parse logs from a group chat, each message should have its own senderName, which allows you to perform analysis on a per-individual basis. You just have to group/filter the dataframes using this field.

Originally posted by @MasterScrat in https://github.com/MasterScrat/Chatistics/issues/44#issuecomment-577400599

opened by rmdes 1

[Hangouts] KeyError: 'conversations'

Hi,

I tried:

$ python parse.py hangouts -f path/to/my/hangouts.json
2020-07-29 04:48:07,372 [INFO ] [parsers.hang]: Parsing Google Hangouts data...
2020-07-29 04:48:07,372 [INFO ] [parsers.hang]: Reading archive file path/to/my/hangouts.json...
2020-07-29 04:48:09,348 [INFO ] [parsers.hang]: Trying to infer own_name from data...
Traceback (most recent call last):
  File "parse.py", line 84, in <module>
    ArgParse()
  File "parse.py", line 42, in __init__
    getattr(self, args.command)()
  File "parse.py", line 60, in hangouts
    main(args.own_name, args.file_path, args.max)
  File "/path/to/Chatistics/parsers/hangouts.py", line 21, in main
    own_name = infer_own_name(archive)
  File "/path/to/Chatistics/parsers/hangouts.py", line 167, in infer_own_name
    for conversation in archive['conversations']:
KeyError: 'conversations'

Then, I tried:

$ python parse.py hangouts --own-name MARCO -f path/to/my/hangouts.json
2020-07-29 04:49:05,521 [INFO ] [parsers.hang]: Parsing Google Hangouts data...
2020-07-29 04:49:05,521 [INFO ] [parsers.hang]: Reading archive file path/to/my/hangouts.json...
2020-07-29 04:49:07,308 [INFO ] [parsers.hang]: Extracting messages...
Traceback (most recent call last):
  File "parse.py", line 84, in <module>
    ArgParse()
  File "parse.py", line 42, in __init__
    getattr(self, args.command)()
  File "parse.py", line 60, in hangouts
    main(args.own_name, args.file_path, args.max)
  File "/path/to/Chatistics/parsers/hangouts.py", line 22, in main
    data = parse_messages(archive, own_name)
  File "/path/to/Chatistics/parsers/hangouts.py", line 102, in parse_messages
    for conversation in archive['conversations']:
KeyError: 'conversations'

Then I tried to copy the file in ./raw_data/hangouts/, call python parse.py hangouts --own-name MARCO and got the same error.

opened by endersaka 2

Stopwords files for several languages

Hi Florian! Good repo, I had a lot of fun with it :)

I have added stopwords files for some missing languages. They were extracted from here and converted using a Python script.

New languages included: arabic, bulgarian, catalan, czech, danish, dutch, finnish, german, hebrew, hindi, hungarian, indonesian, malaysian, italian, norwegian, polish, portuguese, romanian, russian, slovak, spanish, swedish, turkish, ukrainian and vietnamese

opened by khvilaboa 0

Troubles parsing messenger chats

Hello everyone!

I start saying I'm very inexperienced with... well... everything.

While parsing from WhatsApp works smoothly, I'm having some troubles with messenger chats.

This is the output I'm having:

2020-04-25 23:31:17,481 [INFO ] [parsers.mess]: Parsing Facebook messenger data...
Traceback (most recent call last):
  File "parse.py", line 84, in <module>
    ArgParse()
  File "parse.py", line 42, in __init__
    getattr(self, args.command)()
  File "parse.py", line 69, in messenger
    main(args.own_name, args.file_path, args.max)
  File "/Users/*****/Chatistics/parsers/messenger.py", line 22, in main
    data = parse_messages(file_path, own_name)
  File "/Users/*****/Chatistics/parsers/messenger.py", line 64, in parse_messages
    content = fix_text_encoding(content)
  File "/Users/*****/Chatistics/parsers/messenger.py", line 79, in fix_text_encoding
    return text.encode('latin1').decode('utf8')
UnicodeEncodeError: 'latin-1' codec can't encode character '\u201c' in position 17: ordinal not in range(256)

The output from export then prints:

...
2020-04-26 00:43:32,913 [INFO ] [utils       ]: Could not find any data for platform messenger

In /raw_data/messenger I have all the conversations folders that were previously contained in the inbox folder in the original archive downloaded from Facebook. Like this: /raw_data/messenger/username/message_1.json. (I think the documentation should be updated, it makes it look like that the inbox folder needs to be there too).

I'm working in Jupiter lab.

Thanks in advance!

opened by Goretx 0

Owner

Florian

🤖 Machine Learning

GitHub https://masterscrat.github.io/Chatistics/

Python utility to extract differences between two pandas dataframes.

8 Jan 7, 2023

A tool to compare differences between dataframes and create a differences report in Excel

similarpanda A module to check for differences between pandas Dataframes, and generate a report in Excel format. This is helpful in a workplace settin

9 Sep 15, 2022

An extension to pandas dataframes describe function.

pandas_summary An extension to pandas dataframes describe function. The module contains DataFrameSummary object that extend describe() with: propertie

450 Dec 30, 2022

Python scripts aim to use a Random Forest machine learning algorithm to predict the water affinity of Metal-Organic Frameworks

The following Python scripts aim to use a Random Forest machine learning algorithm to predict the water affinity of Metal-Organic Frameworks (MOFs). The training set is extracted from the Cambridge Structural Database and the CoRE_MOF 2019 dataset.

1 Jan 9, 2022

Catalogue data - A Python Scripts to prepare catalogue data

catalogue_data Scripts to prepare catalogue data. Setup Clone this repo. Install

3 Mar 3, 2022

Analysis scripts for QG equations

qg-edgeofchaos Analysis scripts for QG equations FIle/Folder Structure eigensolvers.py - Spectral and finite-difference solvers for Rossby wave eigenf

2 Sep 27, 2022

Created covid data pipeline using PySpark and MySQL that collected data stream from API and do some processing and store it into MYSQL database.

2 Nov 20, 2021

A Python module for clustering creators of social media content into networks

sm_content_clustering A Python module for clustering creators of social media content into networks. Currently supports identifying potential networks

72 Dec 30, 2022

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format.

2 Dec 1, 2021

Import, connect and transform data into Excel

xlwings_query Import, connect and transform data into Excel. Description The concept is to apply data transformations to a main query object. When the

1 Jan 19, 2022

pipeline for migrating lichess data into postgresql

How Long Does It Take Ordinary People To "Get Good" At Chess? TL;DR: According to 5.5 years of data from 2.3 million players and 450 million games, mo

182 Nov 11, 2022

Package for decomposing EMG signals into motor unit firings, as used in Formento et al 2021.

EMGDecomp Package for decomposing EMG signals into motor unit firings, created for Formento et al 2021. Based heavily on Negro et al, 2016. Supports G

13 Nov 1, 2022

For making Tagtog annotation into csv dataset

tagtog_relation_extraction for making Tagtog annotation into csv dataset How to Use On Tagtog 1. Go to Project > Downloads 2. Download all documents,

4 Dec 28, 2021

Demonstrate a Dataflow pipeline that saves data from an API into BigQuery table

Overview dataflow-mvp provides a basic example pipeline that pulls data from an API and writes it to a BigQuery table using GCP's Dataflow (i.e., Apac

1 Dec 3, 2021

Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoothers, and more. Has companion book 'Kalman and Bayesian Filters in Python'.

FilterPy - Kalman filters and other optimal and non-optimal estimation filters in Python. NOTE: Imminent drop of support of Python 2.7, 3.4. See secti

2.5k Dec 30, 2022

A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).

This tutorial's purpose is to introduce Pythonistas to methods for scaling their data science and machine learning work to larger datasets and larger models, using the tools and APIs they know and love from the PyData stack (such as numpy, pandas, and scikit-learn).

102 Nov 10, 2022

Python Library for learning (Structure and Parameter) and inference (Statistical and Causal) in Bayesian Networks.

pgmpy pgmpy is a python library for working with Probabilistic Graphical Models. Documentation and list of algorithms supported is at our official sit

2.2k Dec 25, 2022

🧪 Panel-Chemistry - exploratory data analysis and build powerful data and viz tools within the domain of Chemistry using Python and HoloViz Panel.

???? ??. The purpose of the panel-chemistry project is to make it really easy for you to do DATA ANALYSIS and build powerful DATA AND VIZ APPLICATIONS within the domain of Chemistry using using Python and HoloViz Panel.

97 Dec 8, 2022

Example Of Splunk Search Query With Python And Splunk Python SDK

SSQAuto (Splunk Search Query Automation) Example Of Splunk Search Query With Python And Splunk Python SDK installation: ➜ ~ git clone https://github.c

1 Nov 14, 2021

💬 Python scripts to parse Messenger, Hangouts, WhatsApp and Telegram chat logs into DataFrames.

Related tags

Overview

Chatistics

Changelog

Support Matrix

Exported data

Exporting your chat logs

1. Download your chat logs

Google Hangouts

Facebook Messenger

WhatsApp

Telegram

2. Setup Chatistics

Telegram

3. All done! Play with your data

Export

Histograms

Word Cloud

Development

Improvement ideas

Social medias

Projects using Chatistics

Credits

Comments

Rationale

Important

Owner

Florian

Python utility to extract differences between two pandas dataframes.

A tool to compare differences between dataframes and create a differences report in Excel

An extension to pandas dataframes describe function.

Python scripts aim to use a Random Forest machine learning algorithm to predict the water affinity of Metal-Organic Frameworks

Catalogue data - A Python Scripts to prepare catalogue data

Analysis scripts for QG equations

Created covid data pipeline using PySpark and MySQL that collected data stream from API and do some processing and store it into MYSQL database.

A Python module for clustering creators of social media content into networks

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Import, connect and transform data into Excel

pipeline for migrating lichess data into postgresql

Package for decomposing EMG signals into motor unit firings, as used in Formento et al 2021.

For making Tagtog annotation into csv dataset

Demonstrate a Dataflow pipeline that saves data from an API into BigQuery table

Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoothers, and more. Has companion book 'Kalman and Bayesian Filters in Python'.

A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).

Python Library for learning (Structure and Parameter) and inference (Statistical and Causal) in Bayesian Networks.

🧪 Panel-Chemistry - exploratory data analysis and build powerful data and viz tools within the domain of Chemistry using Python and HoloViz Panel.

Example Of Splunk Search Query With Python And Splunk Python SDK