Monday, November 8, 2010

Updating the Python Job Board

Have you ever looked at the Python.org Job Board? It lists in chronological order postings sent by recruiters and employers looking for developers with skills and interests in Python.

The Job Board has been around since at least 1997 (I checked using the WayBack Machine just to see what it looked like) and not too much has changed since then. New postings are sent by email and a volunteer cuts and pastes them into the head of a text file, fixes the formatting and checks it into Subversion. Instead of the original HTML template, now an ReStructuredText template is used which causes all sorts of problems because few people know what ReST is or have any idea how to format their posting. Eventually, the posting (if accepted) turns up on the web page and later in the RSS feed and on Twitter.

Here are some stats: Currently, there are approximately 40 jobs posted per month. The web page is in the top 10 entry pages for the site month after month gathering around 80k hits per month. The RSS feed gets around twice that and some third party sites re-brand the content and redistribute it which we have no stats for. The Twitter page has around 230+ followers.

Everything on the page is static and the Python infrastructure team has been against dynamic content for good reason but in a time when Google's appengine allows the the easy construction of a dynamic website with additional server capacity on demand, maybe it is time to update the Jobs Board. Two things to change are the process by which jobs are added and the lack of useful search or visualization ("where are the jobs?", "where are the job-seekers?").

The largest part of the work is converting the incoming posting into valid ReST and checking the page after pasting the new posting in. That's it. Emails are answered with a templated response (for both additions and removals) and checking into Subversion takes seconds.

There is good progress being made on a web front-end that will allow employers to directly submit postings and see how they look in their final HTML form. These will be held pending approval. The use of the browser will allow postings to be added and updated from anywhere instead of only from a machine that has the development environment currently required. There is also some intent to support submission by email but that has details yet to be worked out.

All this will be ready for some friendly testing soon (mid-November) and hopefully will reduce the response time and the amount of work needed to add postings as well as adding feedback to employers about how their ad is performing.

If you have comments or suggestions or would like to help, please let me know!

one of those days..

I have not written anything in here for a while because:

1) I'm too busy
2) I don't have anything interesting to say
3) No-one reads this, right?

A funny thing happened today.. I found this post from Steve Yegge that brushed aside my dejected writer's block and reminded me why I started blogging in the first place - partly to help me remember interesting things and partly to improve my writing skills. Or lack of.

If you do read this, leave me a comment.

Monday, July 12, 2010

Moving House

Not literally moving house but I decided it was time to upgrade my python.org local build machine from Redhat ES4 to something more recent so I fired up a Fedora Core 11 VM that I haven't been using so far, copied over my SSH keys and started checking out the website svn repository.

There was nothing wrong particularly with ES4 but Xemacs is a little long in the tooth now and didn't want to mess the machine config around to install Komodo or Eclipse when that VM was really for testing releases at work.

The checkout finished without a hitch (can't get PEPs but that was expected). Now to install docutils, yaml and pyramid.

I am using an RPM-based system so I have a choice of installing the packages from source, installing from easy_install or RPM. My preference is RPM if only to make sure that I have a single place cataloging all software installed on system paths. If I wanted to get into virtualenv or some other sandbox arrangement (not today) then I could go hog-wild and install all sorts of crazy stuff.

I install docutils and run make again and it fails looking for yaml. There's only one YAML available from Yum that is Python related but is it the one that pydotorg is using? A quick search of the wiki and a broken link or two later and I decide to give it a go.

The README in the checkout root directory tells me I need mako (a templating library) as well. Yum adds in python-beaker (a caching library or WSGI middleware layer) and libyaml and away we go.

The next make command succeeds so start the server and presto! Everything is working, time to shut down the ES4 image.

Start to finish.. around 15 minutes. Lots faster than the first time or two when the site went to using Pyramid.

Saturday, June 26, 2010

Python Job Board Tag Cloud

Using TagCrowd (http://www.tagcrowd.com), I created this visualisation of the most common words on the Python Job Board (after removing meaningless, high-frequency words like contact, description, e-mail etc).



created at TagCrowd.com

Wednesday, January 27, 2010

Using Git Behind a Firewall

I am doing a lot of work involving the Samba code base right now and it has frustrated me that I cannot check out the trunk using git because I am behind a firewall.. then again the tarballs for the stable releases have been adequate until I realised that my tinker toy modifications needed to be made into patches and that was tricky..

Long story short:
if you try 'git clone git://git.samba.org/samba.git samba' and it fails because of a firewall issue then change the scheme from git:// to http:// and presto!

I think this only works if the repository you are checking out has a gitweb installation.

Tuesday, July 21, 2009

Python in a web page

"Gestalt is a library released by MIX Online Labs that allows you to write Ruby, Python & XAML code in your (X)HTML pages. It enables you to build richer and more powerful web applications by marrying the benefits of expressive languages, modern compilers, AJAX & RIAs with the write » save » refresh development model of the web."

Python on a web page! In Silverlight .. which means it is time to learn Mono and .Net and all that stuff.

Update: I did nothing to learn any of that stuff then one day I walked into the office and they said "oh, we need you to work on a C# project.. on a Windows box"

Thursday, June 18, 2009

SQLite Gotcha 2

Creating a table with a column of type INT PRIMARY KEY will not result in automatic assignment of a unique value into that column when new rows are added. Why not? In part because INT is not a type in SQLite (but you can use it and the column will have integer affinity but you have to provide the value) and partly because rowid assignment only occurs on columns defined as INTEGER PRIMARY KEY.

Take a look at the documentation on SQLite data types (http://www.sqlite.org/datatype3.html). You may also be surprised to discover that you can declare a column of one type then store data of a different type in it.

Sunday, June 7, 2009

SQLite Gotcha

ROWID is a reserved word in SQLite which seems reasonable.. but it has a synonym OID so if you declare a table with a column called OID then select from that table without using the table name as a prefix, you will not get what you expect.

If you are using an ORM to abstract away the database then you may never realise what is going on..

Unicode Gotcha

What will this give you?

>>> astring = None
>>> print unicode(astring)

If you said None then you will be surprised to find the answer is u"None" which is not at all the same thing. Really messed up my day..

Monday, June 1, 2009

Bing Sucks

Bing, the new search engine from Microsoft, sucks. This conclusion is based on one simple test:
Enter "internet search" as your search term and the results do not contain Yahoo, Google, Bing or LiveSearch. Dogpile is the number one hit.

Remove the word internet and now you get Yahoo listed top.

Try the same test on Yahoo and it lists Bing first in a sponsored results box, followed by Yahoo and Google .. and then Dogpile. Huh?