Python and Social Media Data for the Social Sciences

In July I gave two short workshops at the OII’s Summer Doctoral Programme and also at the Digital Humanities at Oxford Summer School. I had two great groups of bright PhD students and postdocs to teach to. The sessions were only two hours long, and its a big challenge to teach some meaningful programming skills in such a period to complete beginners (in the end, I decided to walk them through a small example project of getting news articles from an RSS feed and checking how many times they have been shared on Facebook, providing most of the code myself). I also rely on lots of technology which I can’t fully control, which is a risk (I want to teach people to connect to things like the Facebook API, which means I need to rely on getting python working on their machine, on their machine connecting to the internet through the visitor wifi, and on the FB API being up and running during class). But the tech worked, mostly, and overall experience was really positive.

python

In the future however I strongly believe that social science needs a better way of integrating computer programming skills into undergraduate and postgraduate teaching, so that these doctoral level workshops can be more about mastering skills and less about training beginners. So I suppose the hope is that in a few years I won’t need to teach such courses any more, even if I do enjoy them.

A “big data” approach to studying parliamentary scrutiny

I have a new article out in the British Journal of Politics and International Relations: In Search of the Politics of Security. In it, I take what could be called a big data approach to the study of parliamentary scrutiny, by scraping information on the passage of legislation from the UK parliament’s website. The website’s current incarnation is relatively recent and there isn’t that much legislation passed every year so I was only able to scrape information on around 150 successfully passed bills. However the information which does come out is quite rich – all recorded votes, amount of time it took to pass the legislation, links to debates and committee hearings, etc. So I still think of it as a kind of big data approach.

My question was pretty simple: does the UK parliament offer less scrutiny on legislation which relates to crime and national security? This emerges from my interest in securitization theory and security politics, which I must admit I have recently been drifting away from slightly (as the war on terror has died down I also think it is becoming slightly less relevant). The project started off as an attempt to measure the scale of this difference, based on what I perceive as a quite widespread assumption that legislators essentially roll over when the government wants to toughen up crime or security law. In the end however I found a relationship in the other direction – such legislation seems to get more attention and scrutiny. It’s a smallish dataset and a limited time period so the conclusions aren’t hard and fast, nevertheless I think it’s a bit of a challenge to the way security politics is often conceptualised.

Why do MOOC users meet face to face?

Last week Monica Bulger, Cristobal Cobo and I presented a paper at the ICA’s pre-conference on higher education innovation. Monica and Cris are the experts in this area and did most of the heavy lifting, but I was pleased to take part, mainly out of a professional curiosity about how Massively Open Online Courses may or may not be changing the face of higher education. In the paper we looked in particular at patterns of offline meetups amongst the users of these online courses, using data from the Meetup API (my role being to facilitate data gathering and manipulation). Meetup have an open and generous stance to API data, and after a bit of coding I was able to extract information on several thousand face to face meetings of students taking part in Coursera courses in over 100 countries around the world.

Meetup - Map

More clicks on Wordle produced a word cloud of the titles of each meetup, which I can’t resist because it looks so nice even if it probably isn’t a good way of doing science.Word Cloud - Titles

What does it all mean? Beyond showing the impressive worldwide reach of Coursera, and the fact that people like face to face interaction when they are learning, we are still deciding to be honest with you. Suggestions welcome.

Computational Social Science: Social Contagion, Collective Behaviour, and Networks

I am part of the organising committee of this event -> part of my growing interest in all things related to sociophysics. Call for abstracts follows:

eccs

Computational Social Science: Social Contagion, Collective Behaviour, and Networks
to be held in Lucca, Italy, 24-25 September 2014

Website: http://cssworkshop.oii.ox.ac.uk/

Important Dates:
Abstract submission deadline 22 June 2014
Conference date 24-25 September 2013

Event Overview
Technology-mediated social collectives are taking an important role in the design of social structures. Yet our understanding of the complex mechanisms governing networks and collective behaviour is still deplorably shallow. Fundamental concepts of on- and off-line networks such as power, authority, leader-follower dynamics, consensus emergence, information sharing, conflict, and collaboration are still not well defined and investigated. These are all crucial to illuminate the advantages and pitfalls of collective decision-making, which can cancel out individual mistakes, but also spiral out of control.
In recent endeavours, data from Twitter, Facebook, Google+, Wikipedia, and weblogs have been shown to strongly correlate to, and even predict, elections, opinions, attitudes, movie revenues, and oscillations in the stock market, to cite few examples. Similar data provided insights into the mechanisms driving the formation of groups of interests, topical communities, and the evolution of social networks. They also have been used to study polarization phenomena in politics, diffusion of information, and the dynamics of collective attention. However, a deeper understanding of these phenomena is still very much on demand. In parallel, and even preceding the surge in interest towards social media, the area of agent-based modeling (ABM) has grown in scope, focus and capability to produce testable hypotheses, going beyond the original goal of explaining macroscopic behaviors from simple interaction rules among stylized agents.
The aim of this satellite is to address the question of ICT-mediated social phenomena emerging over multiple scales, ranging from the interactions of individuals to the emergence of self-organized global movements. We would like to gather researchers from different disciplines and methodological backgrounds to form a forum to discuss ideas, research questions, recent results, and future challenges in this emerging area of research and public interest.

QR codes on ballot papers

_74831461_photo[1]

I was asked to provide a brief comment on this BBC Oxford article about the insertion of QR codes onto ballot papers by a political party in the south east. A really smart idea (and the party is pretty interesting as well), though also one which challenges something about the way we think politics ought to work -> should people still be deciding as they hold the ballot paper in their hand?

Follow

Get every new post delivered to your Inbox.