Skip to content

bernardbeckerman/spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spark - open-ended tracking of user behavior on stackoverflow

• Established relation between [lxml, pyspark]
    ◦ favorites and up/down vote ratio
◦ user reputation and post type ratio (question vs. answer) 
◦ user reputation and number of posts
◦ time of day and waiting time for an answer
◦ first post response quality and site tenure
• Developed synonym finder [word2vec]
• Predicted question tags from body text [pyspark.ml, regex]

About

Analysis of wikipedia dataset using spark

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 106