piiingz / fake-news-detection-weak-labeling Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Weak Labeling of Fake News Articles with Snorkel and Snuba

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data_exploration		data_exploration
describe		describe
feature_engineering		feature_engineering
handle_datasets		handle_datasets
runfiles		runfiles
snorkel_weak_labeling		snorkel_weak_labeling
snuba_weak_labeling		snuba_weak_labeling
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt

Repository files navigation

Weak Labeling of Fake News Articles with Snorkel and Snuba

Requirements

Create a virtual environment, then run pip install -r requirements.txt to install project dependencies.

File structure

The following folders need to be created:

data/

no1_original
no2_original_split/
- fake_split
- real_split
no3_all_features_split/
- fake_split
- real_split
no4_embeddings_split/
- fake_split
- real_split
no5_embeddings
no6_numerical
testset
weak_labeling/
- analysis
- confusion_matrix
describe
sources
snuba/
- goal
- result

Folder explanations

Full dataset with original dataset - NELA-GT-2019 (csv)
Split dataset with original features (csv)
Split dataset with all features except word embeddings (pkl)
Split dataset with only word embeddings (pkl)
Full dataset with only word embeddings (pkl)
Full dataset with numerical features and true labels (pkl and csv)
Cleaned testset as csv
Scores for Snorkel weak labeling systems
For dataset description, histograms and boxplots
Containing the sources from NELA-GT-2019
Scores for Snuba weak labeling system

About

Weak Labeling of Fake News Articles with Snorkel and Snuba

weak-supervision snorkel fake-news weak-labels fakenewsdetection

Report repository

Releases

No releases published

Packages

No packages published

Languages