Engineering Blog

Blog posts tagged 'Server Infrastructure'

Eran TalEngineering

OCP Summit 2017 — Facebook news recap

Posted about 5 months ago

This year's announcements included an end-to-end server refresh, product updates, and new partner collaborations. Read more...

Arlene Gabriana MurilloTechnical Program Manager at Facebook

The end-to-end refresh of our server hardware fleet

Posted about 5 months ago

Innovations across our storage and compute hardware enable our server fleet to scale and improve performance of Facebook's apps and services. Read more...

Kevin LeeEngineering

Introducing Big Basin: Our next-generation AI hardware

Posted about 5 months ago

With Big Basin, we can train machine learning models that are 30 percent larger, which enables our researchers and engineers to move more quickly in developing increasingly complex AI models. Read more...

Jason AdrianEngineering

Introducing Bryce Canyon: Our next-generation storage platform

Posted about 5 months ago

Bryce Canyon will primarily be used for high-density storage and is designed to support more powerful processors and more memory, while also improving thermal and power efficiency. Read more...

Divij RajkumarProduction Engineer at Facebook

Continuous MySQL backup validation: Restoring backups

Posted about 10 months ago

Our system continuously tests our ability to restore our databases from backups, ensuring that we can quickly and reliably recover from an outage. Read more...

Guilherme OttoniSoftware Engineer at Facebook

Redesigning the HHVM JIT compiler for better performance

Posted about 11 months ago

The redesign of the HHVM JIT has improved the CPU usage for running Facebook by about 15 percent. Read more...

Romain KomornEngineering

Making Facebook self-healing: Automating proactive rack maintenance

Posted about a year ago

Aggregate Maintenance Handlers provide a way to safely automate maintenance on multiple servers at once. Read more...

Facebook's new front-end server design delivers on performance without sucking up power

Posted about a year ago

The new design packs more than twice the compute capacity into each rack while maintaining the rack power budget. Read more...

Sai DasariEngineering

OpenBMC for server: Porting and supporting new features for “Yosemite”

Posted about 2 years ago

We've added a series of new features to the OpenBMC, most notably support for the multi-node 1S server platform code-named 'Yosemite.'. Read more...

Qiang WuInfrastructure Software Engineer at Facebook

Making Facebook’s software infrastructure more energy efficient with Autoscale

Posted about 3 years ago

Improving energy efficiency and reducing environmental impact as we scale is a top priority for our data center teams. We’ve talked a lot about our progress on energy-efficient hardware and data center design through the Open Compute Project, but we’ve also started looking at how we could improve the energy efficiency of our software. We explored multiple avenues, including power modeling and profiling, peak power management, and energy-proportional computing. One particularly exciting piece of technology that we developed is a system for power-efficient load balancing called Autoscale. Autoscale has been rolled out to production clusters and has already demonstrated significant energy savings. Read more...

Alex GartrellEngineering

McDipper: A key-value cache for Flash storage

Posted about 4 years ago
blog post · Infra · Data · Web · Storage · Caching · Performance · Server Infrastructure · Data Centers · Photos

Memcache has been used at Facebook for everything from a look-aside cache for MySQL to a semi-reliable data-store for ads impression data. Of course RAM is relatively expensive, and for working sets that had very large footprints but moderate to low request rates, we believed we could make Memcached much more efficient. Compared with memory, flash provides up to 20 times the capacity per server and still supports tens of thousands of operations per second, so it was the obvious answer to this problem. Read more...

Sean LynchEngineering

Monitoring cache with Claspin

Posted about 5 years ago

When I started at Facebook, I joined the newly formed cache performance team in production engineering. Our goal was to get a handle on the health of our various cache systems and to facilitate quick troubleshooting, starting with answering the question, "Is this problem being caused by the cache?". Read more...

MySQL and Database Engineering: Mark Callaghan

Posted about 5 years ago

Mark Callaghan joined Facebook in 2009. Now, working from home in Oregon on the database infrastructure team, he does whatever it takes to keep the database running and users like Trekkie George Takei, a favorite person to get updates from amongst Mark's team, happy. Read on for Mark's insights into fast-storage challenges, Facebook's MySQL roadmap, and more. Read more...

Alethea PowerEngineering

Making Facebook Self-Healing

Posted about 6 years ago

When your infrastructure is the size of Facebook’s, there are always broken servers and pieces of software that have gone down or are generally misbehaving. In most cases, our systems are engineered such that these issues cause little or no impact to people using the site. But sometimes small outages can become bigger outages, causing errors or poor performance on the site. If a piece of broken software or hardware does impact the site, then it's important that we fix it or replace it as quickly as possible. Even if it's not causing issues for users yet, it could in the future so we need to take care of it quickly. Read more...

Xin QiResearch Scientist at Facebook

HipHop for PHP: More Optimizations for Efficient Servers

Posted about 6 years ago

Facebook switched all its production servers to HipHop in early 2010, also releasing the project’s source code at that time. At the time of the switch, HipHop reduced our average CPU usage by 50%, the six months after its release saw an additional 1.8x performance improvement, and in the past six months the team in conjunction with the open source community has made an additional 1.7x improvement. Read more...

Scott MacVicarEngineering

HipHop for PHP: six months later

Posted about 7 years ago

It's been six months since we released HipHop and I wanted to share an update on its progress. In February we released 693,613 lines of source code which on average reduced our CPU usage here at Facebook by about 50%. Since February, the team has made HipHop another 1.8 times faster and all of that code is open source. We’ve also seen improvements to PHP itself with the additions to PHP's trunk in April being about 10% faster than 5.3. Read more...

Haiping ZhaoSenior Server Engineer at Facebook

Keep Updated

Stay up-to-date via RSS with the latest open source project releases from Facebook, news from our Engineering teams, and upcoming events.

Subscribe
Facebook © 2017