High performance *

Methods for obtaining systems high performance

Articles Posts News Authors

straykerwl Jan 19 2022 at 00:19

Comparing PHP-FPM, NGINX Unit, and Laravel Octane

12 min

22K

High performance*PHP*Programming*Web services testing*Microservices*

Translation

Comparing PHP-FPM, NGINX Unit, and Laravel Octane - what to choose for developing microservices.

+10

michael-filonenko Sep 1 2021 at 16:15

In-Memory Showdown: Redis vs. Tarantool

13 min

5.3K

VK corporate blogHigh performance*Database Administration*Tarantool*

In this article, I am going to look at Redis versus Tarantool. At a first glance, they are quite alike — in-memory, NoSQL, key value. But we are going to look deeper. My goal is to find meaningful similarities and differences, I am not going to claim that one is better than the other.

There are three main parts to my story:

We’ll find out what is an in-memory database, or IMDB. When and how are they better than disk solutions?
Then, we’ll consider their architecture. What about their efficiency, reliability, and scaling?
Then, we’ll delve into technical details. Data types, iterators, indexes, transactions, programming languages, replication, and connectors.

Feel free to scroll down to the most interesting part or even the summary comparison table at the very bottom and the article.

+16

Shapelez May 28 2021 at 17:52

Overview of Morris's counters

7 min

1.3K

Qrator Labs corporate blogHigh performance*Algorithms*Mathematics*

On implementing streaming algorithms, counting of events often occurs, where an event means something like a packet arrival or a connection establishment. Since the number of events is large, the available memory can become a bottleneck: an ordinary -bit counter allows to take into account no more than 2^n - 1 events.
One way to handle a larger range of values using the same amount of memory would be approximate counting. This article provides an overview of the well-known Morris algorithm and some generalizations of it.

Another way to reduce the number of bits required for counting mass events is to use decay. We discuss such an approach here [3], and we are going to publish another blog post on this particular topic shortly.

In the beginning of this article, we analyse one straightforward probabilistic calculation algorithm and highlight its shortcomings (Section 2). Then (Section 3), we describe the algorithm proposed by Robert Morris in 1978 and indicate its most essential properties and advantages. For most non-trivial formulas and statements, the text contains our proofs, the demanding reader can find them in the inserts. In the following three sections, we outline valuable extensions of the classic algorithm: you can learn what Morris's counters and exponential decay have in common, how to improve the accuracy by sacrificing the maximum value, and how to handle weighted events efficiently.

+12

codesign Nov 26 2020 at 21:49

How to build a high-performance application on Tarantool from scratch

33 min

2.8K

VK corporate blogHigh performance*Algorithms*Lua*Tarantool*

Tutorial

I came to Mail.ru Group in 2013, and I required a queue for one task. First of all, I decided to check what the company had already got. They told me they had this Tarantool product, and I checked how it worked and decided that adding a queue broker to it could work perfectly well.

I contacted Kostja Osipov, the senior expert in Tarantool, and the next day he gave me a 250-string script that was capable of managing almost everything I needed. Since that moment, I have been in love with Tarantool. It turned out that a small amount of code written with a quite simple script language was capable of ensuring some totally new performance for this DBMS.

Today, I’m going to tell you how to instantiate your own queue in Tarantool 2.2.

+18

vovkins Jun 26 2020 at 11:31

Making a Tarantool-Based Investment Business Core for Alfa-Bank

10 min

1.8K

VK corporate blogHigh performance*System Analysis and Design*Database Administration*Tarantool*

A still from «Our Secret Universe: The Hidden Life of the Cell»

Investment business is one of the most complex domains in the banking world. It's about not just credits, loans, and deposits — there are also securities, currencies, commodities, derivatives, and all kinds of complex stuff like structured products.

Recently, people have become increasingly aware of their finances. More and more get involved in securities trading. Individual investment accounts have emerged not so long ago. They allow you to trade in securities and get tax credits or avoid taxes at the same time. All clients coming to us want to manage their portfolios and see their reporting on-line. Most frequently, these are multi-product portfolios, which means that people are clients of different business areas.

Moreover, the demands of regulators, both Russian and international, also grow.

To meet the current needs and lay a foundation for future upgrades, we've developed our Tarantool-based investment business core.

+14

dokshina Apr 13 2020 at 14:34

Deploying Tarantool Cartridge applications with zero effort (Part 2)

11 min

1.4K

VK corporate blogHigh performance*Database Administration*Distributed systems*Tarantool*

Tutorial

We have recently talked about how to deploy a Tarantool Cartridge application. However, an application's life doesn't end with deployment, so today we will update our application and figure out how to manage topology, sharding, and authorization, and change the role configuration.

Feeling interested? Please continue reading under the cut.

+15

fessoga5 Feb 25 2020 at 19:19

Fault Tolerance Web Architecture for Our Cloud Solutions

10 min

VK corporate blogHigh performance*Virtualization*Cloud computing*Cloud services*

Hi Habr,

I'm Artyom Karamyshev, a system administration team leader at Mail.Ru Cloud Solutions (MCS). We launched many products in 2019. We've aimed to make API services easily scalable, fault-tolerant, and ready to accommodate rapid growth. Our platform is running on OpenStack, and in this article, I describe all the component fault tolerance issues that we've resolved.

The overall fault tolerance of the platform is consists of its components fault tolerance. So, I'm going to show you step by step tutorial about all levels where we've found the risks.

+23

gen Dec 18 2019 at 12:48

Accelerating PHP connectors for Tarantool using Async, Swoole, and Parallel

6 min

2.2K

VK corporate blogHigh performance*PHP*SQL*NoSQL*

In the PHP ecosystem, there are currently two connectors for the Tarantool server: the official PECL extension tarantool/tarantool-php written in C, and tarantool-php/client written in PHP. I am the author of the latter one.

In this article I would like to share the results of performance testing of both these libraries and show how you can achieve 3x-5x performance improvement (on synthetic tests!) with minimal changes in code.

+39

dokshina Dec 16 2019 at 12:41

Deploying Tarantool Cartridge applications with zero effort (Part 1)

8 min

1.9K

VK corporate blogHigh performance*IT Infrastructure*Database Administration*Distributed systems*

We have already presented Tarantool Cartridge that allows you to develop and pack distributed applications. Now let's learn how to deploy and control these applications. No panic, it's all under control! We have brought together all the best practices of working with Tarantool Cartridge and wrote an Ansible role, which will deploy the package to servers, start and join instances into replica sets, configure authorization, bootstrap vshard, enable automatic failover and patch cluster configuration.

Interesting, huh? Dive in, check details under the cut.

+29

vasiliy-t Oct 21 2019 at 19:09

Tarantool Kubernetes Operator

10 min

1.8K

VK corporate blogHigh performance*Database Administration*Kubernetes*Tarantool*

Kubernetes has already become a de-facto standard for running stateless applications, mainly because it can reduce time-to-market for new features. Launching stateful applications, such as databases or stateful microservices, is still a complex task, but companies have to meet the competition and maintain a high delivery rate. So they create a demand for such solutions.

We want to introduce our solution for launching stateful Tarantool Cartridge clusters: Tarantool Kubernetes Operator, more under the cut.

+34

mt144 Oct 16 2019 at 15:10

Tarantool Data Grid: Architecture and Features

6 min

2.1K

VK corporate blogHigh performance*Lua*Data storage*Tarantool*

In 2017, we won the competition for the development of the transaction core for Alfa-Bank's investment business and started working at once. (Vladimir Drynkin, Development Team Lead for Alfa-Bank's Investment Business Transaction Core, spoke about the investment business core at HighLoad++ 2018.) This system was supposed to aggregate transaction data in different formats from various sources, unify the data, save it, and provide access to it.

In the process of development, the system evolved and extended its functions. At some point, we realized that we created something much more than just application software designed for a well-defined scope of tasks: we created a system for building distributed applications with persistent storage. Our experience served as a basis for the new product, Tarantool Data Grid (TDG).

I want to talk about TDG architecture and the solutions that we worked out during the development. I will introduce the basic functions and show how our product could become the basis for building turnkey solutions.

+37

Rosik Oct 9 2019 at 17:32

Тarantool Cartridge: Sharding Lua Backend in Three Lines

8 min

2.4K

VK corporate blogHigh performance*Lua*Distributed systems*Tarantool*

In Mail.ru Group, we have Tarantool, a Lua-based application server and a database united. It's fast and classy, but the resources of a single server are always limited. Vertical scaling is also not the panacea. That is why Tarantool has some tools for horizontal scaling, or the vshard module [1]. It allows you to spread data across multiple servers, but you'll have to tinker with it for a while to configure it and bolt on the business logic.

Good news: we got our share of bumps (for example, [2], [3]) and created another framework, which significantly simplifies the solution to this problem.

Тarantool Cartridge is the new framework for developing complex distributed systems. It allows you to concentrate on writing business logic instead of solving infrastructure problems. Under the cut, I will tell you how this framework works and how it could help in writing distributed services.

+24

kaiuk Sep 10 2019 at 14:27

Enumerable: How to yield a business value

6 min

2.1K

High performance*.NET*C#*ООP*Microservices*

This article is a brief explanation about how using a common language keywords might have an influence on the budget of IT-infrastructure of a project or help to achieve some limitations/restrictions of hosting infrastructure and, moreover, will be a good sing of the quality and maturity of the source code.

+10

o6CuFl2Q Jun 25 2019 at 17:42

How to speed up LZ4 decompression in ClickHouse?

23 min

15K

Яндекс corporate blogHigh performance*Open source*C++*Big Data*

When you run queries in ClickHouse, you might notice that the profiler often shows the LZ_decompress_fast function near the top. What is going on? This question had us wondering how to choose the best compression algorithm.

ClickHouse stores data in compressed form. When running queries, ClickHouse tries to do as little as possible, in order to conserve CPU resources. In many cases, all the potentially time-consuming computations are already well optimized, plus the user wrote a well thought-out query. Then all that's left to do is to perform decompression.

So why does LZ4 decompression becomes a bottleneck? LZ4 seems like an extremely light algorithm: the data decompression rate is usually from 1 to 3 GB/s per processor core, depending on the data. This is much faster than the typical disk subsystem. Moreover, we use all available CPU cores, and decompression scales linearly across all physical cores.

+19

danikin May 13 2019 at 12:02

Citymobil — a manual for improving availability amid business growth for startups. Part 5

8 min

VK corporate blogHigh performance*System Analysis and Design*Start-up developmentProduct Management*

This is the final part of the series describing how we’re increasing our service availability in Citymobil (you can read the previous part here). Now I’m going to talk about one more type of outages and the conclusions we made about them, how we modified the development process, what automation we introduced.

+24

danikin May 7 2019 at 12:43

Citymobil — a manual for improving availability amid business growth for startups. Part 4

7 min

VK corporate blogHigh performance*System Analysis and Design*Start-up developmentProduct Management*

This is the next article of the series describing how we’re increasing our service availability in Citymobil (you can read the previous parts here: part 1, part 2, part 3). In further parts, I’ll talk about the accidents and outages in detail.

1. Bad release: database overload

Let me begin with a specific example of this type of outage. We deployed an optimization: added USE INDEX in an SQL query; during testing as well as in production, it sped up short queries, but the long ones — slowed down. The long queries slowdown was only noticed in production. As a result, a lot of long parallel queries caused the database to be down for an hour. We thoroughly studied the way USE INDEX worked; we described it in the Do’s and Dont’s file and warned the engineers against the incorrect usage. We also analyzed the query and realized that it retrieves mostly historical data and, therefore, can be run on a separate replica for historical requests. Even if this replica goes down due to an overload, the business will keep running.

+15

danikin Apr 29 2019 at 11:51

Citymobil — a manual for improving availability amid business growth for startups. Part 3

8 min

1.1K

VK corporate blogHigh performance*System Analysis and Design*Start-up developmentProduct Management*

This is the next article of the series describing how we’re increasing our service availability in Citymobil (you can read the previous parts here and here). In further parts, I’ll talk about the accidents and outages in detail. But first let me highlight something I should’ve talked about in the first article but didn’t. I found out about it from my readers’ feedback. This article gives me a chance to fix this annoying shortcoming.

+23

morozovsk Apr 29 2019 at 01:15

Statistics and monitoring of PHP scripts in real time. ClickHouse and Grafana go to Pinba for help

6 min

5.4K

High performance*PHP*

Tutorial

In this article I will explain how to use pinba with clickhouse and grafana instead of pinba_engine and pinboard.

On the php project pinba is probably the only reliable way to understand what is happening with performance. But usually people start to use pinba only when problems are already observed and it isn't clear where to look in.

Often developers have no idea how many RPS each script has. So they begin to optimize starting from places that seem to have problem.

Someone is analyzing the nginx logs, and someone is slow queries in the database.

Of course pinba would not be superfluous, but there are several reasons why it is not on every project.

+14

danikin Apr 25 2019 at 12:03

Citymobil — a manual for improving availability amid business growth for startups. Part 2

8 min

977

VK corporate blogHigh performance*System Analysis and Design*Start-up developmentProduct Management*

This is a second article out of a series «Citymobil — a manual for improving availability amid business growth for startups». You can read the first part here. Let’s continue to talk about the way we managed to improve the availability of Citymobil services. In the first article, we learned how to count the lost trips. Ok, we are counting them. What now? Now that we are equipped with an understandable tool to measure the lost trips, we can move to the most interesting part — how do we decrease losses? Without slowing down our current growth! Since it seemed to us that the lion’s share of technical problems causing the trips loss had something to do with the backend, we decided to turn our attention to the backend development process first. Jumping ahead of myself, I’m going to say that we were right — the backend became the main site of the battle for the lost trips.

+21

danikin Apr 22 2019 at 18:52

Citymobil — a manual for improving availability amid business growth for startups. Part 1

4 min

1.3K

VK corporate blogHigh performance*System Analysis and Design*Start-up developmentProduct Management*

In this first part of an article series «Citymobil — a manual for improving availability amid business growth for startups» I’m going to break down the way we managed to dramatically scale up the availability of Citymobil services. The article opens with the story about our business, our task, the reason for this task to increase the availability emerged and limitations. Citymobil is a rapid-growing taxi aggregator. In 2018, it increased by more than 15 times in terms of number of successfully completed trips. Some months showed 50% increase compared with the previous month.

The business grew like a weed in every direction (it still does): there was an increase in server load, team size and number of deployments. At the same time the new threats to service availability emerged. The company faced a task of the most importance — how to increase availability without compromising company growth. In this article, I’ll talk about the way we managed to solve this task in a relatively short time.

+24