Showing posts with label google reader. Show all posts
Showing posts with label google reader. Show all posts

Saturday, 19 November 2011

Data driven journalism

“Journalism has always been about reporting facts and assertions and making sense of world affairs. No news there. But as we move further into the 21st century, we will have to increasingly rely on “data” to feed our stories, to the point that “data-driven reporting” becomes second nature to journalists.”
Zach Beauvais

The above statement closely resonates with the work of American journalist and Freedom of Information activist,  Heather Brooke, who helped to expose UK parliamentary expenses. However, as a trainee journalist, the last thing that comes to my mind is data. Statistics are not my forte. I was terrible at maths in school and to be fair, I have managed to get through life knowing the basic - addition, subtraction, division and multiplication. So, imagine my face when, in digital journalism class, there was talk of mean, median and mode. Huh? I never thought I’d have to do averages again. Ever. 

But, my lecturer Andy Dickinson aka digidickinson put things into perspective when he said: “The mark of a good journalist is not knowing how to do everything, but, knowing whom to ask.” This statement alone was enough to get my attention.

In 3 hours, I learned that “data” can in fact provide interesting stories and is not as hard as it seems to analyse. The class began with looking at how to create forms and spreadsheets on Google documents and how these can be useful to obtain, arrange and visually display information or results. Andy taught us how to ‘scrape’ and ‘clean’ using html formulas.

Scraping

This is a way to get information out of web pages and in to a spreadsheet. Google docs also have a clever way to ‘scrape’ websites as part of their spreadsheet tool. It involves a simple copy and paste using a, not so simple, formula and wallah the data magically appears, from your chosen web page and in to your spreadsheet. Thankfully for me, I wrote the instructions down.

As a class we copied the table of results of the best-selling albums, off wikipedia (for practice purpose only) and transferred this information into our Google spreadsheet. Once the data is in the spreadsheet, you are allowed to ‘clean’ it in terms of getting rid of any errors. Depending on what sort of data you are dealing with, the super helpful formulas (and this I did find impressive) will help you find the average, or in mathematical terms, the mean, median and mode of your selected cells within that one spreadsheet - you just have to know the right formula to tell Google spreadsheet what to do.

It really is something that, I see, can help journalism, also, the formulas can easily be found on the internet so you don’t actually have to do any maths! 

The “daddy of data scraping”

Funded by Channel 4, Scraperwiki is a website predominantly for computer programmers to help data requesters and, right now, is the current fave among journalists as the go to place for data related information. As we discussed in the lecture, the relationship between journalists and programmers is important. In many cases it helps to serve the public interest, especially if the data in question can reveal valuable and newsworthy material, just as the analysis of MPs expenses did.

Meet Junar was another website that Andy introduced us to. Not only can you collect, organise and use data but it also contains data that you can explore and which might be of use to you. It also has a social element by allowing the user to share their data on social networking platforms. Now, this is more my cup of tea. 

The Guardian's DATABLOG, edited by Simon Rogers, a pioneer in data driven journalism, is a great place for inspiration when it comes to how to report stories based on data and more importantly how to present the data in visual form. More recently, the DATABLOG, published a world map dotted with the occupy protests, which on it's own painted a powerful story. It goes to show that data can be used in so many different ways.

Visualisation

How do you present data in an engaging yet simple form? This part I didn’t mind. Bar charts, pie charts, line graphs and tables – they all work well with stories based on facts and figures. This sort of traditional visualisation of data goes hand in hand with stories about growth, decline, change, comparison and ranking. However, if you want to opt for a more simplistic yet effective way of telling a story then take a look at Wordle.net – a toy for generating “word clouds” from text that you provide. The clouds give greater prominence to words that appear more frequently in the source text.

wordle.net is free and easy to use                  Zoom in
Check out the word cloud above which I created on Wordle.net using a story, headlined, Hamid Karzai tells loya jirga: no US military pact until night raids cease, from the Guardian's website

The most prominent words in the news report are Afghan, Karzai, military, night, raids, partnership, sovereignty, national, Afghanistan, strategic and operations. The word cloud instantly portrays the angle and of this story and you now have a good idea of what's written in the full report. 

By the end of the class, I felt much more comfortable about having to analyse and organise data for journalistic use.

Sunday, 2 October 2011

Digital journalism tools

Every day I trawl through dozens of sites to get the latest news, travel tips and music reviews. For as long as I can remember, I have always noticed the little orange box in the corner of a web page yet steered well away from it with the firm belief that RSS feeds are too technical for my liking.

It turns out that I could be saving myself a lot time if I was brave enough to explore the mystery behind the little orange boxes. As our own digital guru, Andy Dickinson, a.k.a digidickinson, explains, RSS feeds are in fact simple to use and wonderfully convenient at saving you time. In a 3-hour digital journalism class he introduces us to Google Reader. It allows you to subscribe to numerous RSS feeds, i.e. the little orange boxes on a website, from where you must copy the RSS link and paste it into your Google Reader profile. This means that you now get all the information you want to receive in one place as Google Reader filters information from your favourite websites, updating you with the latest content from each, based on the order it is published.

What’s even more useful is that you can group your feeds into appropriate folders to optimise your browsing in one feed. For example, I have a ‘Travel News’ feed where I have subscribed via RSS to National Geographic Traveller and Guardian Travel. As my beat for my practical journalism module is environment, I also subscribed to the environment sections of the BBC and Preston’s local paper, the Lancashire Evening Post. The feeds from these sites are grouped under ‘Environment News’. So now, instead of visiting dozens of websites, I just log into Google Reader and the content from my subscriptions is there, ready for me to browse and read, and even star articles that I wish to come back to later. I’ll be clicking the little orange boxes more, now that I have discovered Google Reader.


Diagram shows how RSS serves as the mechanism for syndication and aggregation of RSS-enabled Web content - meaning that RSS feeds are useful for online users and beneficial to publishers too. 
Illustration by Jason Rhodes
Digidickinson then moved onto Delicious. The basis of this site is to collect links that you find worth sharing. The site offers elements of social networking whereby users are able to search, view and share each other’s lists of links. These lists can include the most unusual content such as ‘Zombie Apocalypse Survival Guide’ to more special interest links such as ‘Mosaic Sculpture Gardens’. Once more, this site is a useful time saver. You can search any topic and a pre-made list of links will present itself to you. Furthermore, if the links are on Delicious, then no doubt, an online user thought they were worth sharing and could prove more useful than 750 pages of Google or Bing results.

Then, we go onto Trunk.ly, a social bookmarking tool that fulfills the users needs for ‘data curation’. Its homepage reads –‘…the easiest way to save links online’ – and that is exactly what it is. Great for storing links to things on the web you may want to return to later – and especially great – because you can access these links from any computer, unlike a bookmark, which is specifically saved to your own laptop. Trunk.ly also remembers any links that you share on your online platforms, whether that be on Twitter, Facebook, your blog or even Google Reader. All your links are stored here so you don’t have to trawl through your internet history to find them again. 

Enter Twitterfeed – an online (re)distribution tool that acts as a self promotion platform. Twitterfeed will automatically tweet new posts that you publish onto your blog. So, once I’ve written this, my humble following of 600+ tweeps will see a link to this very post on their timeline. The most exciting online platform that our digital mastermind introduced us to, is something he uses to inform his students of his whereabouts – incase they struggle to cope with him not being in his office. He does this through ifttt (if this then that). The site automates feeds based on trigger events. These commands can trigger by dates and times and link to other social network platforms. For example, digidickinson will create a trigger that on Thursday afternoons between 12-3, his twitter page @wheresandy will post a tweet that reads, ‘I’m teaching postgrads til 3pm’. Obviously, very useful for the long queue of undergrads waiting to see him!