Loading…
October 25-27, 2017 - Prague, Czech Republic
Click Here For Information & Registration
Frameworks [clear filter]
Friday, October 27
 

09:45 CEST

DIY Mesos Executor - Tomasz Janiszewski, Allegro
The Mesos executor is a part of Mesos that could be replaced with custom implementation. The executor controls tasks lifecycle. In this talk I will present the benefits that comes from writing a custom executor.

Speakers
avatar for Tomasz Janiszewski

Tomasz Janiszewski

Software Engineer, Allegro
Tomasz is a software engineer passionate about distributed systems. He believes in free and open source philosophy and occasionally contributes to projects on GitHub. At Allegro he works as a Software Engineer working with Mesos and Marathon cluster.



Friday October 27, 2017 09:45 - 10:35 CEST
Congress Hall 2

11:00 CEST

More Bang for Your Buck: How Yelp Autoscales Mesos + Marathon on AWS Spot Fleet - Rob Johnson, Yelp
Yelp was an early adopter of Mesos and Marathon, building PaaSTA, a PaaS that provides an easy way for developers to deploy their services and batches. As we migrated more parts of the infrastructure to run on PaaSTA, we had to figure out how to maximize cluster utilization and minimize costs. In this talk, I'll discuss how Yelp autoscales both services and servers, shuffling tasks around our Mesos cluster to improve utilization, whilst dealing with the extra volatility caused by running on AWS Spot Fleet. I’ll tell stories of outages, strategies for improving resilience against AWS pulling the plug on instances with 2 minutes warning and gracefully migrating services actively serving traffic, and discuss how we decide when to increase and decrease cluster capacity.

Speakers
RJ

Rob Johnson

Site Reliability Engineer, Yelp inc.
Rob works as a Site Reliability Engineer on the Operations team at Yelp in London. Most of Rob's time is spent developing PaaSTA, Yelp’s internal platform-as-a-service, which runs nearly all of Yelp's production services. Rob has spoken at MesosCon previously about PaaSTA, and is... Read More →


Friday October 27, 2017 11:00 - 11:50 CEST
Congress Hall 2

12:00 CEST

Marathon and Jobs - Today and Tomorrow, Johannes Unterstein, Mesosphere, Inc.
Marathon and Chronos have been the core and essential frameworks for most Apache Mesos clusters. Marathon ensures long running tasks are running in the datacenter and is commonly used to maintain high availability of other frameworks. Chronos is the fault tolerant cron of the datacenter. Not long ago, Mesosphere introduced Metronome as a replacement for Chronos based on the Marathon code base. There has been significant change in Marathon to leverage the latest features of Apache Mesos, shift the framework to a reliable and stable state and improve its overall performance and scalability. This session will provide an overview of all the new features, a reasoning about how and why Marathon will grow, and will conclude with a look at the roadmap for Marathon in particular and frameworks launched via Marathon in general.

Speakers
avatar for Johannes Unterstein

Johannes Unterstein

Graphs, containers and fun, Neo4j
Johannes is doing things with containers and graphs, check it out: https://neo4j.com/cloud/



Friday October 27, 2017 12:00 - 12:50 CEST
Congress Hall 2

14:00 CEST

Seagull: A Distributed, Fault Tolerant, Concurrent Task Runner - Sagar Patwardhan, Yelp Inc.
At Yelp, we all strive to increase developer productivity by decreasing the time to test, deploy, and monitor changes. To enable developers to push code safely, we run more than 20 million tests every day. Yelp has a monolith web application which has 100,000 tests; running them sequentially takes approximately 2 days to finish. We built an in-house distributed system called Seagull, which splits these 100,000 tests into smaller chunks(bundles) using our bespoke algorithm and runs these chunks in parallel on the compute cluster to finish all the tests in less than 10 mins. Seagull uses Apache Mesos to schedule these run test bundles on AWS spotfleet. We have written a custom autoscaler for AWS spotfleet which dynamically adjusts the cluster capacity based on different utilization metrics for optimal use of resources. Seagull runs more than 2 million ephemeral docker containers every day.

Speakers
SP

Sagar Patwardhan

Software Engineer, Yelp inc.
My name is Sagar Patwardhan and I am a software engineer at Yelp Inc. I am part of the distributed systems(Infrastructure) team. I have been with Yelp for a little over two years; I am currently working on building mesos infrastructure and next-gen batch processing infrastructure... Read More →



Friday October 27, 2017 14:00 - 14:50 CEST
Congress Hall 2

15:00 CEST

A Year with Apache Aurora - Rick Mangi, Chartbeat
Chartbeat, a real time web publishing analytics platform made the decision last year to migrate the bulk of our workload from puppet managed AWS EC2 instances to Aurora. Today, the majority of our migration is done and this talk will focus on what we learned and the decisions made along the way, including what not to migrate to Aurora. We chose to adopt Aurora for a variety of reasons: cost savings from better resource utilization, consistent deployments and monitoring of services, streamlining of the development workflow and the ability to approach scaling our platform holistically. Aurora has helped us to reign in server sprawl and get a much better handle on our footprint. We don't claim to have done everything "right", but it works for us and that's all that matters.

Speakers
avatar for Rick Mangi

Rick Mangi

Director of Platform Eng. & DevOps, Chartbeat
Head of Platform Engineering and DevOps (Platopus) at Chartbeat, Rick is a 20 year industry veteran focused on startups in the publishing and media space. At Chartbeat, he has been focused on bringing a successful startup to the next level of efficiency and reliability with apache... Read More →



Friday October 27, 2017 15:00 - 15:50 CEST
Congress Hall 2

16:00 CEST

Disaster Recovery with a Distributed Database - Max Neunhöffer & Jörg Schad, Mesosphere, Inc.
Enterprises large and small want to have a disaster recovery plan at the ready when they are running distributed databases. The simplest such setup uses two database clusters in two different data centers with asynchronous replication of all updates. However, already this seemingly simple approach causes considerable head scratching for architects and developers of a distributed data store, considering fault tolerance, network failures, automatic failover and write load spikes.

This talk explains how ArangoDB implements asynchronous DC to DC replication between clusters using the Kafka message broker on both sides together with an incremental replication protocol and automatic write ahead log tailing. Apache Mesos plays a vital role in managing the deployment, managing resources, improve utilization, ensure scalability and ensuring liveness of the whole system.

Speakers
avatar for Max Neunhöffer

Max Neunhöffer

Database Developer, ArangoDB
Max Neunhöffer is a mathematician turned database developer. In his academic career he has worked for 16 years on the development and implementation of new algorithms in computer algebra. During this time he has juggled a lot with mathematical big data like group orbits containing... Read More →
avatar for Jörg Schad

Jörg Schad

CTO, ArangoDB
Jörg Schad is the CTO at ArangoDB. In a previous life, he has worked on or built machine learning pipelines in healthcare, distributed systems, including early Kubernetes code at Mesosphere, and in-memory databases. He received his Ph.D. for research about distributed databases and... Read More →



Friday October 27, 2017 16:00 - 16:50 CEST
Congress Hall 2
 
Filter sessions
Apply filters to sessions.