Creating a Bubble Catalogue
In recent weeks, I’ve spent much of my time figuring out how to use all of your drawings to determine where the bubbles are in the Spitzer data. About a month ago we had a breakthrough. Thanks to a lengthy conversation with MWP science guru Matthew Povich, I realised that one of the reasons it is so hard to determine where a bubble should be drawn is that sometimes there is no right answer! There are many bubbles in the MWP that people would disagree on how to draw – the reason is that there is often not necessarily a right answer to the question “where is the bubble?”.
An example of just such a bubble is shown below, with all user drawings shown next to it. You can see that this bubble just isn’t that easy to draw and that there are even two or three structures within the image that one could call a bubble. Instead of trying to make this fit a rigid one-bubble definition, we realised that we should be using the human ability to recognise patterns. After all – this is exactly what you are all so good at, and computers are sometimes not.
Myself and Matthew decided that what we should do in these instances is simply allow two (or even three) bubbles to be deemed as ‘real’. The inner, red structure is a kind of bubble, and so is the open-ended green bubble just outside of it. One could also perceive a third bubble just below and to the left of these, and many people appear to have drawn just that. (This is in addition the multitude of smaller bubbles around the edge, of course). Whatever catalogue is produced by our data reduction, it probably should include at least the first two structures if enough people drawn them.
This decision has made creating a cleaned bubble catalogue much easier. The data reduction process described in my February blog post is still the process I’m using, although it has been greatly refined. More importantly, since February an enormous number of new bubbles have been drawn and this means the averaging process produces better results. Below you can see some results of the latest efforts and hopefully you’ll agree that what is being produced is a good catalogue, based on what you have all drawn. For the sake of testing, I am using one 3-by-2 degree section of the data. This is the region +12 degrees from the galactic centre and contains several interesting and complex features – which makes it a good testing ground.
Below you can see the 3×2 degree tile on its own, with all of your 7,000+ bubbles drawn on top and with the resultant ‘cleaned’ bubbles as well. You can click on any of the images to see the full version.
I have also been looking into other techniques for extracting the bubbles as the crowd sees them. Below you can see just the raw bubble data, drawn by users for this tile. With the background removed, we can use a simple contrast ratio to create a threshold, which we use to cut-out the bubbles from the original image.
This is another method for extracting data, and although it is harder to define a rigid catalogue of bubbles using this method, it may still have use in mapping regions of star formation in our galaxy.
Talk Updates
Our two new community collaboration websites, Milky Way Talk and Planet Hunters Talk, had some updates this week. We thought it was worth going over them in this blog post. We’ve had a lot of feedback about Talk and are working to implement the most-requested features.
The biggest difference you’ll see when logging into Talk is that your discussions are now easier to manage and track. A new, large box on the main page shows all the new and updated discussions since your last login. You can refine these using the two drop-down boxes at the top of this section. You can chose to show discussions from the last 24 hours, the last week, or since any date using a pop-up calendar. You can also chose to only see discussions that you are a part of, which should help you keep track of your conversations.
In addition to these changes, you’ll also find a lot more metadata around the discussions, telling you who last posted, how many people are taking part, and who started the discussion, where relevant. Users within these discussions are now highlighted if they are part of the development team or the science team. This is something a lot of you asked for.
The other item that has been changed with this Talk update is pagination. There are now easy-to-use buttons on the discussions, collections and objects on the front page. These mean that you can browse back through time and see more than just the most recent items. As Talk has grown more popular, this feature has become more necessary.
Another change to the front page is that we now show the most-recent items by default, and not the trending items. You can still see the trending items by clicking the link at the top. Users told us they preferred to see recent activity initially so we made the change. Similarly, the ‘trending keywords’ list now appears on the front page at all times.
Finally, page titles are now meaningful. This means that if you bookmark or share a link, you’ll remember why. Collections are named and objects will be title dusing their Zooniverse ID (e.g. AMW….). Several of you have also noted our lack of a favicon (the little icon next to the URL in your browser bar). This is coming shortly as well.
There are more changes planned for Talk, but these significant updates to the front page were worth noting on the blog. For example, we plan to start integrating social media links into the Talk sites, along with more updates as time goes by. Talk continues to evolve and we welcome feedback at team@milkywayproject.org.
The Milky Way Project
So after adding in a third entry a couple of days ago, it rapidly ran ahead of the pack on the last day of voting. We had more votes on the final day than in all the time leading up to the decision. But we have a name: The Milky Way Project. Stellar Zoo was a close second, and both beat Milky Way Zoo by some way.
Over the next few days, this blog will change from ‘Project IX’ to ‘The Milky Way Project’. We have a new twitter feed @milkywayproj and eventually the URL for this blog will also change. I’ll give plenty of warning about that.
The following two weeks involve a big code push here in Oxford, to try and create a beta site for you to try out. There will be more updates soon with a sample of the first science interface on its way…
[Image credit: NASA/JPL-Caltech]































