Jobmined, Part II (ish)

The first round of JobMine postings closed last night at 11:59 EST, and although this isn't exactly Jobmined data, it still is tangentially related to JobMine.

If you haven't read Lisa Zhang's post yet, I highly recommend you do (or at least have it open for reference). Her blog has a bunch of statistics-related things that make for an interesting read.

Methodology

Due to the way JobMine works, I have made the following assumptions and normalizations to the dataset:

  1. For any listing with multiple locations, only the first location is considered
  2. Postings with "Various" as the location are adjusted with the first Google result of the company's name

Definitions

For our purposes, a job posting is the notice that employers post on JobMine. Each posting can have multiple openings, which is the number of students they'll ideally hire. Now onto the JobMine employer statistics!

Heatmap & Posting Numbers

Here's a heatmap of the weekend's postings, with companies from 4 of the 7 continents offering co-op positions to Waterloo students.

Job Openings Heatmap

You can click on the map for a larger image, but it doesn't really do justice to the sheer volume of postings in Toronto and the Kitchener/Waterloo area.

Here's the top 10 locations with the greatest amount of job openings.

Location Openings
Toronto 1,492
Waterloo 556
Mississauga 315
Kitchener 273
Ottawa 263
San Francisco 127
Guelph 107
Vancouver 102
Markham 99
Chalk River 73

It's no real surprise that Toronto and Waterloo retain their number 1 and 2 positions. What's interesting is that San Francisco—which wasn't even on Lisa's Top 10 list—checks in at position 6.

Also, merely 66 companies contribute to Kitchener's 273 open positions!

Here's the top 10 cities by postings—it's same 10 cities, just in different orders.

Location Postings
Toronto 992
Waterloo 285
Mississauga 195
Ottawa 185
Kitchener 137
San Francisco 88
Vancouver 81
Markham 68
Chalk River 64
Guelph 44

If we take the top 10 locations by the number of applicants, we see a similar story.

Location Applications
Toronto 58,117
Waterloo 16,640
San Francisco 9,318
Mississauga 9,171
Kitchener 8,718
Ottawa 5,181
Markham 3,321
Vancouver 3,173
Cupertino 3,030
Palo Alto 2,801

The applications heatmap doesn't really show anything different from the openings map.

Job Applications Heatmap

But if we look at the top cities that get the most applicants per job opening instead, it's a completely different picture.

Location App/Opening
Fremont, CA 217
Venice Beach, CA 174
Redmond, WA 156
Dublin, Ireland 145
Mountain View, CA 144
Tokyo, Japan 132
Picton, ON 114
Cupertino, CA 113
Fergus, ON 95
Sunnyvale, CA 94

If you think about companies that have offices in each city, the numbers start to make a lot of sense.

Cities like Dublin don't offer many job openings, but they are immensely popular among co-op students (most likely because of the foreign location). They are also usually satellite offices for other Silicon-Valley companies, so language isn't that big of an issue, if at all.

In addition, Waterloo students are not limited to these postings—we're also free to apply to jobs outside of JobMine. That's part of the beauty of the co-op system, and shows just how flexible it is.

I think co-op is great, giving you opportunities to travel around and see places that you might not otherwise go. I even know a few people who factor in places they want to visit, when choosing their next co-op placement!

n-grams and Job Titles

An n-gram is simply a continuous sequence of n items from a given sequence.

Unigrams are simply n-grams with n = 1. In this case, we choose to look at words—although we could've chosen to look at individual characters.

I threw the job titles through a tokenizer, removing any stop words in the NLTK stop words list.

Top 50 Unigrams in Job Titles

Not surprisingly, when we consider the top 50 unigrams, "software", "developer" and "engineering" took the top 3 spots.

However, I was a little surprised at how many companies referred to their hires as "co-op" students.

At least in Computer Science, most students want to be treated as full-time employees, and work on things that actually matter (and not shoved into a corner working on something that might not actually make it into production)—so much to a point that various companies will emphasize during their info sessions that co-op students are treated the same way as full-time employees. The designation of being "just a co-op" definitely turns some people away.

I thought I'd also point out that the reason the number "8" shows up fairly often is because some employers add this in the job titles to indicate whether this is a 4 or 8 month posting.

If we look at the bigrams (n = 2), we see something similar for the top two spots.

Top 50 Bigrams in Job Titles

Some interesting titles include "summer student", "supply chain", and the fact that "test engineering" is marginally higher than "quality engineering" and "assurance analyst".

Job titles aren't very long, but we can look at trigrams to look for more patterns. Again, the top 4 results aren't particularly surprising.

Top 50 Trigrams in Job Titles

Note: The numbers in the titles are job posting IDs for various internal applications that companies may prefer candidates use.

You may want to right click on the graphs to view a larger resolution image.

I think it may be worthwhile to look into how postings for various faculties affects the way employers tailor their postings. We'll take a look at that in the next post.