Archive for the ‘Geo Technologies’ Category

Correcting Earth’s Parentage

Friday, March 5th, 2010

The letter “E” in WOEID stands for Earth, as in the planet we live upon, not the stuff we walk upon. This means that, for now at least, WOEIDs are strictly Terran in scope, though that hasn’t stopped people requesting we cover other planetary bodies in our solar system including the Moon and Mars.

For now, the Earth, represented by WOEID 1, is as far up the place hierarchy as you can go and the parent of WOEID 1 is a null or empty WOEID. The GeoPlanet web service responds to an attempt to get the parent of WOEID 1 as an invalid request and the GeoPlanet Data download has -1 as the parent of WOEID 1.

So far, so good.

When we initially released GeoPlanet Data, we designed it to be used as a form of data escrow, preserving people’s investment in tagging with WOEIDs. Things never work out precisely as you’ve planned though and we’ve noticed that a lot of people are taking GeoPlanet Data and loading it up into a database. That means that that -1 value as the parent of WOEID 1 is forcing you to use a signed data type to hold a WOEID. We think you should be able to use an unsigned data type for this purpose and so from the next release of GeoPlanet Data you’ll find that the parent of WOEID 1 will be 0 and not -1.

Please bear this in mind when you come to do other wonderful things with our geo-data download.

Gary Gale, Director of Engineering, Yahoo! Geo Technologies

Photo credit: IronRodArt on Flickr.

Visualising and Playing with the Data

Friday, February 26th, 2010

One of the great things about the data that sits behind GeoPlanet is just how rich and varied it is and how that richness can be easily exposed and employed with a few lines of code, either from the GeoPlanet API itself or from the geo table of YQL.

But it’s a dual edged sword …

One of the not so great things about the data that sits behind GeoPlanet is despite the fact that it’s rich and varied you need to be able to write code in order to get at it …

I was recently talking about this to YDN evangelist extraordinaire Chris Heilman who took it as a personal challenge. Less than 24 hours later the GeoPlanet Explorer was born, built entirely from PHP, YQL, YUI and GeoPlanet. Chris’s screencast shows just how easy it is and just how much rich geographic data there is.

But don’t take our word for it … go and find out for yourselves at http://isithackday.com/geoplanet-explorer/index.php?woeid=44417

Gary Gale, Director of Engineering, Yahoo! Geo Technologies

Disputed Places – “de facto” vs. “de jure”

Friday, February 19th, 2010

We recently released the latest version of our geo-data to the world at large, both via the GeoPlanet API and via GeoPlanetData. Hidden away inside the data was a real world example of the part of the GeoPlanet documentation which talks about Naming and Representation of places:

“Yahoo! aims to capture the geography of the Earth as it is used by the world’s people. To this end we are guided by various standards and sources of geographical information. For country codes and names, we rely on ISO 3166-1 but make no specific claim as to official designation or authority of disputed territories.

We appreciate that the subjective and personal nature of world geography ensures that there is no single authoritative hierarchy and we do not aim to impose one here. Rather, the GeoPlanet hierarchy is presented to facilitate geographic discovery, and ultimately assist in disambiguating identically named places, and resolving spatial appellations to a unique, open, and permanent identifier.”


Which is a nice way of saying that sometimes not everyone agrees on the naming and definition of a place. So how best to handle situations which involve a disputed place? Let’s use Cyprus as an example because inside the latest set of geo-data are some tweaks to how we represent Cyprus.

The Embassy of Cyprus got in touch with us because they were concerned with the way in which Cyprus was being represented on Yahoo! Weather. Specifically, that the island was being described as comprised of the Greek Cypriot Republic and the TRNC; the Embassy was unhappy that we were referencing the Turkish Republic of North Cyprus. So we looked into it.

It’s at this point that the de facto and de jure of this post’s title became apparent. The TRNC does exist as a de facto entity and as such it belongs in any geographical gazetteer that attempts, as we do, to capture the world’s geography as it’s used by the world’s people, although we are making no claim as to the legality, official designation or authority of the place.

But we also accept that the TRNC is not recognised as a de jure geographical entity and the previous classification of the TRNC as an administrative unit disagreed with international standards, including ISO 3166-1. So here’s what we did.

  • The TRNC became an informal region and its boundaries were checked to ensure it contains only the places that it should contain.
  • We added Northern Cyprus and North Cyprus as place name variants, with Northern Cyprus as the preferred label and/or name form.
  • The previous Greek Cypriot Republic WOEID was deleted and mapped to the WOEID of the country, together with any alternative names such as Rep. Cyprus and Republic of Cyprus.
  • The district entities previously at Admin Level 2 moved up to become Admin Level 1, in line with ISO 3166-2.
  • We checked that all other name forms, types and language were consistent with local understanding and usage.

That’s a lot of work to be undertaken, but it’s work that’s necessary and essential in order to keep our geo-data and geo products up to date, relevant and global. Next time you use GeoPlanet or one of our other geo offerings, offer up a thought to the Geo Technologies Data team who deal with this sort of data acrobatics on a regular basis.

Gary Gale, Director of Engineering, Yahoo! Geo Technologies


Photo credit: Eric Magnuson on Flickr.