A tool for matching Redditors with similar interests based on their subreddit subscriptions and activity.
Decided to setup docker just in case, can be built/ran with ./run.sh
i fucked up spacing in a file for some reason
- https://anvaka.github.io/sayit
- apparently https://www.reddit.com/r/Serendipity has a bot with similar tactics
- Gensim's word2vec modeling package
- /u/Evthma's association rule mining subreddit dataset
- Pushshift.io for unlimited user data
- collaborative filtering
- priori/association rule learning
- Hausdorff distance of sets
- Jaccard Index
- default subreddit subscriptions
- remove NSFW subreddits (or have the option to?)
- number of subscribers in a sub (smaller subs could be a more special interest)
- activity vs subscriptions
- Discussion with the actual owner of r/submatch is currently ongoing about how to proceed with development.
- Issues will be created tomorrow regarding for anyone's input regarding:
- structure of the repo
- algorithm discussion/planning
- how the tool will be made available to users (webapp? bot?)