It’s a bit late notice unfortunately, but we’ll be doing another meetup on July 16th in Ghent, featuring a very promising talk by renowned data geek Jimmy Lin about Twitter’s big data mining infrastructure. Space is limited, so you should head to our corresponding meetup page straight away to reserve your spot.
The analytics platform at Twitter has experienced tremendous growth over the past few years in terms of size, complexity, number of users, and variety of use cases. This talk will discuss the evolution of the Twitter infrastructure and the development of capabilities for data mining on “big data”. We’ll share experiences as a case study, but make recommendations for best practices and point out opportunities for future work.
About the speaker
Jimmy Lin is an associate professor in the iSchool at the University of Maryland, with appointments in the Institute for Advanced Computer Studies (UMIACS) and the Department of Computer Science. He works on “big data”, with a particular focus on large-scale distributed algorithms for text processing. His research lies at the intersection of natural language processing (NLP) and information retrieval (IR). Recently, Jimmy spent an extended sabbatical (from 2010 to 2012) at Twitter working on large-scale data analytics. Previously, he has also done work for Cloudera, the enterprise Hadoop company.
Good news everyone!
The schedule for our 9th meetup is complete, we will have three talks from different areas of the big-data universe:
We hope you like this schedule, as much, as we do and see plenty of you!
The friendly folks of NGDATA in Gent will host our 9th meetup. Thanks for that already!
Next to a location, we are always looking for interesting things to discuss during the meetup. Have you read something interesting in the bigdata/nosql space lately? Are you implementing something amazing right now? Do you have a problem, that you want to discuss? Let us know!
Looking forward to hearing from you all!
As our 8th meetup turned out into a hackaton on Twitter’s Storm, we devised a small presentation on what our use case was going to be: based on a person’s name, we fetch various social network profile information and put them next to each other to highlight the differences.
the next meetup is already approaching and we are still missing some interesting topics to discuss.
So if you have read something lately that is worth mentioning, or if you’re in the middle having a breakthrough on an interesting brain teaser, or if you are implementing a wonderful project or just doing anything else relevant to our domain, please take a moment to prep some slides and get a discussion going on our 8th meetup!
Looking forward to hearing from you all!
Three weeks ago our litte community on bigdata had their 7th meetup in Brussels. We think it is a good idea to hold our meetups in different cities, since we are the Belgian bigdata community. (If you can host a meetup in your city, please contact us!). Next to the typical evening traffic chaos and a meeting of all European prime ministers there was a crime scene (some sort of knife fight) next to our meeting place, which caused some of our participants to arrive a bit later, than planned.
Nevertheless did we have a good schedule, which consisted out of two talks with lots of good interaction between the speakers and the audience.
The first talk was about storm a distributed realtime processing framework coming out of twitter. Daan Gerrits gave an introduction into storm and walked us through an example application he had created for this meetup.
The second talk (by me) was about apache giraph a graph processing framework on top of apache hadoop.
If you have been to one of our meetings and you liked it, please spread the word, leave comments here, and consider the “call for papers” for our 8th meetup in July open!
So it has been a whlie since we held our first meetup on July 5th, 2011. We had a lively discussion on ideas, wants and won’ts for our young but apparently vibrant community. After some discussion in our group, we prefer to setup our meetups using a rotating schedule over Tuesday, Wednesday and Thursday at an interval of 6-7 weeks.
So, we’ll be calendering our 2nd meetup for Wednesday August 24th, 2011. Keep an eye on our meetup page.
All ideas for a topic that night are more then welcome!
We have all been anxiously waiting for that special day on which we may kick some life into our community. For those who have no idea what I’m talking about: The Belgian BigData launch event will take place tomorrow in Ghent!
There are a few things we would like to talk about, but most importantly we want your feedback and brilliant idea’s regarding BigData concepts, technologies and the Belgian community.
As you may know, the event will start tomorrow (july 5th) at 6:30 PM in the Atari room of the IBBT Zuiderpoort Office Park (Gaston Crommenlaan, 8 (bus 102), Ghent – map ). The following items are a rough outline of the evening:
- Members introduction
- Community brainstorm
- bbuzz debrief
21 members already confirmed their presence. If you are not one of them and you still want to join the event you can do so on our meetup page. All information about the event can be found there, as well as the list of members which will join us tomorrow.
We are impatiently looking forward to meet you.
See you tomorrow!
It is time to meet up in real life! IBBT in Ghent, is so kind to host our very first meeting and you are invited. We have created a meetup-page and would like to ask you to subscribe, if you are interested in joining us for an informal #bbuzz debrief and community introduction. The event is free of charge and should be seen as a first contact event of our new little community.
If you are interested in big data and joining a brand new community, put your name on the attendee list!
Hope to see you there!
After 2 intensively immersive days on big data at the Berlin Buzzwords (#bbuzz) conference in Berlin, the bigdata.be crew is back in Belgium again. Those two days were a rollercoaster ride of meeting smart people and listening to great talks on everything big data, NoSQL datastores and scalable search.
There were quite some bigdata.be people present at the event: Andre Kelpe, Wim Van Leuven and Steven Noels (who presented on Lily) with 2 of his colleagues at Outherthought. Of course we weren’t noticeable amongst the crowd of about 450 data geeks: we might have missed some other Belgians out there. So, if you were there or know somebody who was, please step forward and get in touch! We want you!
The conference itself was a great lineup of two keynotes and numerous sessions on three tracks (store, scale and search) by the most knowledgeable people in our specific domain. Check out the agenda for Monday and Tuesday! Slides are posted here as they become available.
There were a few presentations that kept simmering through the hallways. Jonathan Gray‘s exposé on Realtime Big Data at Facebook with Hadoop and HBase was much bespoken because a bit controversial. Remember that Apache Cassandra was developed at and open-sourced by Facebook in 2008.
Also, during the second day keynote, Ted Dunning challenged the Hadoop, and more generally, the Apache community. He postulated that the days as a community are over. Too many stakeholders participate with too many conflicting interests. For the first time in its existence, the Apache Software Foundation is confronted with such a large scale community. A community that is becoming an ecosystem. And the ASF is not the right body to manage it. A provocative but also challenging thought. But what is the right structure to manage that eco-system? A Linux-like guerilla approach? Or an overly structured standards body? These are some thoughts that I have been discussing with Lars George, but of course without finding an answer. There’s only a sense of future promises. A future that looks difficult but interesting …
Remember if you were there, or not, but you are interested in big data, we want you! Get in touch and join!!!