codebase
open-source computational social science tools for students and scholars
Hello, these scripts are for everyone, but they're designed for people who want to explore their (social science) data and are interested in programming (in R). Everything I do here you can do as well, for free, on a reasonably modern computer. I'll try to illustrate my code with interesting real data. You can choose to read or code-along.
Recommended guides to complement the content on this page:
Google Colab Notebooks
These notebooks are interactive chunks of annotated code along with their output. You could run these on a smartphone if you like programming with your thumbs.
You will need a google account to copy and modify code directly in the browser. The Gemini AI assistant (available on the top right corner of each notebook) can help explain bits of the code you'd like to know more about, and help you adapt it to your data and needs.
Access data from a Google Sheet. Conduct a basic content analysis of a text corpus. Find most frequent words, compare across categories and over time.
Access data from Google Drive. Make sense of a large corpus of texts from computational summaries based on an LDA topic model. Use topic models for purposive sampling.
Create a co-author network from a SCOPUS database export.
Randomly assign order of presentations or anonymize student names. Curve grades based on target mean and median.
Rmarkdown Notebooks
Programming tutorials: Annotated chunks of R code and the output of that code. These programs require a bit more juice to run, so they're best run on your laptop or desktop. To code along, please install R and RStudio on your machine, in that order.
Requirements: An internet connection and a Twitter developer account with an approved Academic Track project.
Applications: Analyzing and representing changes in texts over time
Applications: Inductively identifying key themes, identifying "fingerprints" of authors or publications in a large longitudinal text data. Measuring and representing similarity/dissimilarity/change over time (Good starting point for Lit Reviews).
Coming soon...
Comparing the performance of LDA and BERT topic models
Creating a co-citation network from data exported from Web of Science/SCOPUS
Reference
To cite this page for code or research methodology, please use:
Bhardwaj, A. (2024) Codebase: Open-source computational social science tools for students and scholars. https://www.anandb.net/code