Parallel LINQ in Depth (4) Performance

Tuesday, September 24, 2019

C# .NET LINQ PLINQ Parallel LINQ Parallel Computing

[LINQ via C# series]

[Parallel LINQ in Depth series]

The purpose of PLINQ is to utilize multiple CPUs for better performance than LINQ to Objects However, PLINQ can also introduces performance overhead, like source partitioning and result merging. There are many aspects that impact PLINQ query performance.

Sequential query vs. parallel query

To compare the performance of sequential and parallel query, take OrderBy query as example. PLINQ’s OrderBy requires partitioning the source, as well as buffering and merging the results. The following function compares the query execution duration of sequential OrderBy and parallel OrderBy:

internal static void OrderByTest(

Func<int, int> keySelector, int sourceCount, int testRepeatCount)

{

int[] source = EnumerableX

.RandomInt32(min: int.MinValue, max: int.MaxValue, count: sourceCount)

.ToArray();

Stopwatch stopwatch = Stopwatch.StartNew();

Enumerable.Range(0, testRepeatCount).ForEach(_ =>

{

int[] sequentialResults = source.OrderBy(keySelector).ToArray();

});

stopwatch.Stop();

$"Sequential:{stopwatch.ElapsedMilliseconds}".WriteLine();

stopwatch.Restart();

Enumerable.Range(0, testRepeatCount).ForEach(_ =>

{

int[] parallel1Results = source.AsParallel().OrderBy(keySelector).ToArray();

});

stopwatch.Stop();

$"Parallel:{stopwatch.ElapsedMilliseconds}".WriteLine();

}

It calls the RandomInt32 query, which is defined in the LINQ to Objects custom queries chapter, to generate an array of random int values with the specified length. Then it executes the sequential and parallel OrderBy queries repeatedly for the specified times, so that the total execution time can be controlled in a reasonable range, the following code calls the OrderByTest function compares the sequential/parallel OrderBy execution on arrays of small/medium/large size, with the same simple key selector:

internal static void OrderByTestForSourceCount()

{

OrderByTest(keySelector: value => value, sourceCount: 5, testRepeatCount: 10_000);

// Sequential:11 Parallel:1422

OrderByTest(keySelector: value => value, sourceCount: 5_000, testRepeatCount: 100);

// Sequential:114 Parallel:107

OrderByTest(keySelector: value => value, sourceCount: 500_000, testRepeatCount: 100);

// Sequential:18210 Parallel:8204

}

The following code compares the sequential/parallel OrderBy execution on arrays of the same size, with different key selector of light/medium/heavy workload:

internal static void OrderByTestForKeySelector()

{

OrderByTest(

keySelector: value => value + ComputingWorkload(baseIteration: 1),

sourceCount: Environment.ProcessorCount, testRepeatCount: 100_000);

// Sequential:37 Parallel:2218

OrderByTest(

keySelector: value => value + ComputingWorkload(baseIteration: 10_000),

sourceCount: Environment.ProcessorCount, testRepeatCount: 1_000);

// Sequential:115 Parallel:125

OrderByTest(

keySelector: value => value + ComputingWorkload(baseIteration: 100_000),

sourceCount: Environment.ProcessorCount, testRepeatCount: 100);

// Sequential:1240 Parallel:555

}

It turns out PLINQ has better performance than LINQ to Objects with larger source and expensive iteratee function, which has a better chance to offset the overhead of partitioning and buffering/merging.

CPU bound operation vs. I/O bound operation

So far, all the examples are CPU bound operations. In most cases, PLINQ by default takes the logic processor count as the degree of parallelism. This makes sense for CPU bound operations, but may be not ideal for I/O bound operations. For example, when downloading files from Internet with parallel threads, it could be nice if the worker thread count can be controlled accurately disregarding the CPU core count. The following ForceParallel extension method can be implementation for this purpose:

internal static void ForceParallel<TSource>(

this IEnumerable<TSource> source, Action<TSource> iteratee, int degreeOfParallelism)

{

if (degreeOfParallelism <= 1)

{

throw new ArgumentOutOfRangeException(nameof(degreeOfParallelism));

}

IList<IEnumerator<TSource>> partitions = Partitioner

.Create(source, EnumerablePartitionerOptions.NoBuffering) // Stripped partitioning.

.GetPartitions(degreeOfParallelism);

ConcurrentBag<Exception> exceptions = new ConcurrentBag<Exception>();

void IteratePartition(IEnumerator<TSource> partition)

{

try

{

using (partition)

{

while (partition.MoveNext())

{

iteratee(partition.Current);

}

catch (Exception exception)

{

exceptions.Add(exception);

}

Thread[] threads = partitions

.Skip(1)

.Select(partition => new Thread(() => IteratePartition(partition)))

.ToArray();

threads.ForEach(thread => thread.Start());

IteratePartition(partitions[0]);

threads.ForEach(thread => thread.Join());

if (!exceptions.IsEmpty)

{

throw new AggregateException(exceptions);

}

It calls Partitioner.Create with EnumerablePartitionerOptions.NoBuffering, to enable stripped partitioning for better load balance. It then calls the created partitioner to create the specified number of partitions, and uses current thread and additional threads to simultaneously pull each partition and call the iterate function.

To demonstrate the I/O bound operation, the following function first visualizes sequential download, then visualizes parallel download with PLINQ, and finally visualizes parallel download with above ForceParallel function. Again, assuming a quad core CPU, the degree of parallelism is specified as 10, which is higher than the core count:

internal static void DownloadTest(string[] uris)

{

byte[] Download(string uri)

{

using (WebClient webClient = new WebClient())

{

return webClient.DownloadData(uri);

}

uris.Visualize(EnumerableEx.ForEach, uri => Download(uri).Length.WriteLine());

const int DegreeOfParallelism = 10;

uris.AsParallel()

.WithDegreeOfParallelism(DegreeOfParallelism)

.Visualize(ParallelEnumerable.ForAll, uri => Download(uri).Length.WriteLine());

uris.Visualize(

query: (source, iteratee) => source.ForceParallel(iteratee, DegreeOfParallelism),

iteratee: uri => Download(uri).Length.WriteLine());

}

The following code queries some thumbnail picture file URIs from the Flickr RSS feed with LINQ to XML, then pass the URIs to above function to visualize the download:

internal static void RunDownloadTestWithSmallFiles()

{

string[] smallThumbnailUris = XDocument

.Load("https://www.flickr.com/services/feeds/photos_public.gne?id=64715861@N07&format=rss2")

.Descendants((XNamespace)"http://search.yahoo.com/mrss/" + "thumbnail")

.Attributes("url")

.Select(uri => (string)uri)

.ToArray();

DownloadTest(smallThumbnailUris);

}

Here sequential download takes longer time, as expected. The PLINQ query is specified with a max degree of parallelism 10, but it decides to utilize 5 threads. ForceParallel starts 10 threads exactly as specified, and its execution time is about half of PLINQ.

The following code queries for the same Flickr RSS feed, but for large picture file URIs, and visualize the download:

internal static void RunDownloadTestWithLargeFiles()

{

string[] largePictureUris = XDocument

.Load("https://www.flickr.com/services/feeds/photos_public.gne?id=64715861@N07&format=rss2")

.Descendants((XNamespace)"http://search.yahoo.com/mrss/" + "content")

.Attributes("url")

.Select(uri => (string)uri)

.ToArray();

DownloadTest(largePictureUris);

}

This time PLINQ still utilizes 5 threads from the beginning, then decides to start 2 more threads a while later. ForceParallel simply start 10 threads since the beginning. However, the duration of sequential download, PLINQ download, and ForceParallel download are about the same. This is because when downloading larger files, the network bandwidth is fully occupied and becomes the performance bottleneck, so the degree of parallelism does not make much difference.

Factors to impact performance

This part and the previous parts have demonstrated many aspects that can have performance impact for PLINQ, and here is a summary:

· The partitioning strategy can impact performance, because different partitioning algorithms introduce different synchronization and load balance.

· The 2 execution modes, Default (sequential or parallel) and ForceParallel, can result different performance

· The degree of parallelism can impact performance, when degree of parallelism is set to 1, PLINQ works like sequential LINQ to Object.

· The merge option can also impact performance, smaller buffer size can have the early value results available faster, but can also make the query execute longer.

· The order preservation can impact the performance, query as unordered can have better performance, but can lead to incorrect results.

· The source size can impact performance, for source with smaller size, the overhead of parallelization can be more significant, and result even lower performance than sequential query.

· The iteratee function provided to query can impact performance, more expensive iteratee functions can have better performance with parallel queries.

· The type of operation can impact performance, utilize more CPU cores can improve the performance of compute bound operation, but I/O bound operations can also depend on the I/O hardware.

In the real world, the performance of each PLINQ query has to be measured and optimized accordingly.

Summary

PLINQ’s query execution performance is impacted by many aspects. First PLINQ partitions source for parallel query. PLINQ implements range partitioning, chuck partitioning, hash partitioning, and stripped partitioning. These partitioning algorithms require different synchronization, and result different load balance among the query threads to impact the overall performance. .NET Standard also provides APIs to define custom static, dynamic, and orderable partitioners. To utilize multi-processor and offset the overhead of partitioning and merging, PLINQ can have better performance with larger size source and more expensive iteratee function. PLINQ’s query performance also should be optimized according to the type of operation.

44 Comments

Thank you for another interesting article Dixin. You ought to publish this material somehow. Speaking of flickr, I also like your flickrstream and the pictures of Mt Rainer. Keep the articles and photos coming!

John Grant - Thursday, May 12, 2016 9:52:45 AM

آکادمی فوتبال کتاب به صورت تخصصی و حرفه ای آموزش فوتبال را در رده سنی پایه می‌پردازد و علاوه بر یادگیری مهارت‌های فوتبال به آموزش دروس مدرسه در تمامی مقاطع تحصیلی ورزشجویان در راستای پرورش و بهره‌وری بیشتر از مهارت های فردی می پردازد.

https://fa-ketab.com

آکادمی فوتبال کتاب - Thursday, May 6, 2021 1:46:52 AM

تاکسی vip با کیفیت عالی قیمت مناسب آرامش و امنیت را برای مسافران خود به ارمغان می آورد، ما هر روزه هفته ۲۴ ساعته خدمتگذار شما عزیزان هستیم. از مزایای تاکسی vip می توان به حضور رانندگان مجرب و با سابقه شرکت تاکسیرانی، احساس راحتی و آرامش و استفاده از اتومبیلهای با ضریب امنیت و کیفیت بالا اشاره کرد.

شرکت تاکسی vip - Thursday, May 6, 2021 1:56:59 AM

Thank you for another interesting article Dixin.

سعر فورتشنر 2022 - Tuesday, September 14, 2021 10:35:17 PM

https://ma-study.blogspot.com/

medicalphd - Monday, December 13, 2021 11:33:39 AM

Some people argue that education is also wealth. Without good education, it becomes very difficult for anyone to devise ways to generate money. These days, adults in need of returning to school are required to meet certain conditions before they are allowed to attend classes.

writemyessay.nyc - Thursday, April 28, 2022 9:44:51 AM

Thanks for this great post. I look forward to your next posts

friday night funkin - Friday, June 3, 2022 8:21:08 PM

From the middle peasants of the online sports

1xbet Apk download - Thursday, July 28, 2022 2:06:03 AM

I've been looking for photos and articles on this topic over the past few days due to a school assignment, Keonhacai and I'm really happy to find a post with the material I was looking for! I bookmark and will come often! Thanks :D

Keonhacai - Wednesday, August 3, 2022 6:36:46 PM

En Chile 1xbet apuestas deportivas

1xbet apuestas deportivas - Saturday, September 3, 2022 4:40:27 PM

Conoce más acerca la Inkabet APP, que llegó para que tus apuestas se puedan hacer donde y cuando quieras sin excusas.

Inkabet APP - Wednesday, October 12, 2022 7:32:41 AM

Betsson Perú cuenta con una gran variedad de juegos para apostar y obtener ganancias, mientras se disfruta de una experiencia muy distinta a lo que se vive en los casinos tradicionales.

Betsson - Friday, October 14, 2022 7:21:12 AM

Por un lado, están los inkabet bonos, que generalmente son: Inkabet Bono casino, Inkabet bono online e Inkabet bono apuestas deportivas.

inkabet bonos - Saturday, October 15, 2022 4:05:53 PM

Los slots de Betsson Casino te brindan mucha calidad y variedad para elegir.

betsson casino - Sunday, October 16, 2022 8:13:06 AM

I came to this site with the introduction of a friend around me and I was very impressed when I found your writing. I'll come back often after bookmarking! casinosite

casinosite - Tuesday, October 25, 2022 1:58:34 AM

le permiten experimentar plenamente la emoción del deporte, ya que puede elegir entre cientos de tipos diferentes de apuestas que se realizarán en varias disciplinas durante el transcurso de los eventos.

Apuestas en vivo - Sunday, November 6, 2022 3:28:43 PM

Una de las formas que tienes de comenzar tu camino en el mundo de las apuestas deportivas

Betsson Colombia - Sunday, November 6, 2022 3:29:41 PM

Haz clic sobre el enlace de descarga

Betsson App - Sunday, November 6, 2022 3:30:37 PM

Your expertise in this area is truly admirable, and I'm sure your book has provided valuable insights and knowledge to readers who are interested in parallel programming with LINQ. Thank you for sharing your expertise and contributing to the field of computer science.

Watch AEW Rampage Online - Sunday, April 16, 2023 10:18:56 PM

Everything looks different but interesting in itself. Are you really the one who wrote it? Very good.

ติดต่อ bet game - Friday, June 16, 2023 7:06:25 PM

I was just amazed to see your post. I got my whole attention. I just can’t explain how much informative this was.

สล็อตคลิปโต - Thursday, August 17, 2023 7:47:07 PM

با توجه به این موضوع که بسیاری از افراد درگیر مشغله های کاری و تحصیلی هستند ترجیح می دهند برای مراقبت و نگهداری از سالمندشان پرستار ساعتی سالمند در منزل استخدام کنند.

پرستار ساعتی سالمند - Tuesday, October 24, 2023 12:33:50 PM

The assignment submission period was over and I was nervous, <a href="https://images.google.com.vc/url?sa=t&url=https%3A%2F%2Fwww.mtclean.blog/">safetoto</a> and I am very happy to see your post just in time and it was a great help. Thank you ! Leave your blog address below. Please visit me anytime.

safetoto - Wednesday, November 22, 2023 1:49:55 AM

I am very impressed with your writing <a href="https://images.google.com.uy/url?sa=t&url=https%3A%2F%2Fwww.mtclean.blog/">casino online</a> I couldn't think of this, but it's amazing! I wrote several posts similar to this one, but please come and see!

casino online - Wednesday, November 22, 2023 1:53:12 AM

Hello ! I am the one who writes posts on these topics <a href="https://images.google.com.ua/url?sa=t&url=https%3A%2F%2Fwww.mtclean.blog/">totosite</a> I would like to write an article based on your article. When can I ask for a review?

totosite - Wednesday, November 22, 2023 1:53:37 AM

I was looking for another article by chance and found your article <a href="https://images.google.com.tw/url?sa=t&url=https%3A%2F%2Fwww.mtclean.blog/">baccarat online</a> I am writing on this topic, so I think it will help a lot. I leave my blog address below. Please visit once.

baccarat online - Wednesday, November 22, 2023 1:54:02 AM

You write a lot of articles for me. And every story makes me unable to stop reading. you are awesome I will keep reading your articles.

ติดต่อ Asia Gaming - Friday, November 24, 2023 9:16:20 PM

Thanks for sharing this information with us. This is a fantastic website.

สล็อตออนไลน์ - Monday, December 18, 2023 11:34:27 PM

The writing is even-handed, noting where the claim in the film.

Watch Wrestling - Saturday, January 20, 2024 4:12:43 AM

I admire your ability to distill complex ideas into clear and concise prose.

ติดต่อสอบหวยบี - Wednesday, June 5, 2024 1:17:58 AM

Your reading thinking ability must be great. therefore can be conveyed in this article.

ทางเข้า หวย ธกส - Monday, June 24, 2024 9:27:46 PM

Your unique perspective adds depth and richness to the discourse surrounding [topic of the article].

มาเล 4d - Monday, July 8, 2024 8:13:15 PM

hey good one

crypto sports betting - Sunday, October 6, 2024 9:31:51 AM

Good UX design ensures that products are easy to use and navigate. This reduces frustration and enhances the overall functionality, making it more likely that users will successfully achieve their goals with the product.

Bli med på NorskeCasinoeronlien.com - Friday, November 1, 2024 3:23:57 PM

An exceptional article! The content is well-researched, informative, and presented in a very engaging way. I really appreciate how the author made a complicated subject easy to understand.

Engineering Assignment Help - Thursday, February 13, 2025 6:27:29 AM

PLINQ aims to harness multiple CPUs for superior performance compared to LINQ to Objects. However, while it can boost efficiency, it may introduce overhead from source partitioning and result merging. Just like in Funny Shooter 2, where strategy is key to success, understanding these performance factors is essential for optimizing PLINQ queries.

Funny Shooter 2 - Tuesday, February 25, 2025 12:14:24 AM

Let me read your articles every day. Because it's really fun.

แนะนำเพื่อน wm casino - Tuesday, March 11, 2025 4:38:06 PM

Edit videos easily with KineMasterAPK.

kinemaster mod apk download - Sunday, April 27, 2025 5:43:23 AM

So, it turns out that sometimes running parallel queries can be like trying to herd cats—totally chaotic but occasionally worth it. Speaking of chaos, I can't help but wonder how 'I Want to Love You Till Your Dying Day' manages its plot twists with all that intensity! Check out <a href="https://iwanttoloveyoutillyourdyingday.com/" >I Want to Love You Till Your Dying Day</a>.

I Want to Love You Till Your Dying Day - Tuesday, May 27, 2025 11:13:04 PM

PLINQ might seem like the underdog hero, battling its own overheads, but you know what they say—every dog has its day. Just like every football match in 'Football Bros' has that one moment of glory! Check it out .

Football Bros - Tuesday, May 27, 2025 11:13:38 PM

It’s hilarious how parallel queries can sometimes be slower than sequential ones, kind of like how I am at playing games—it’s all fun until I realize I’ve been outplayed! For a different kind of fun, have you tried 'Funny Shooter 2'? It sure does keep you on your toes! Check .

Funny Shooter 2 - Tuesday, May 27, 2025 11:13:53 PM

With PLINQ’s quirks and performance impacts, it feels like watching a sports drama unfold—full of highs and lows! Speaking of drama, 'Rooftop Snipers' has its own share of tense moments that keep you on the edge of your seat! Don't miss it .

Rooftop Snipers - Tuesday, May 27, 2025 11:14:12 PM

Trying to balance performance in parallel queries is like sledding downhill—thrilling but you might end up crashing! For another adrenaline rush, check out 'Snow Rider 3D'—it'll take you on a wild ride! Look here .

Snow Rider 3D - Tuesday, May 27, 2025 11:14:25 PM

PLINQ performance hinges on many factors! Partitioning, execution mode, and parallelism degree all play key roles. Even seemingly minor choices, like merge options and order preservation, matter. Small source sizes might ironically suffer from parallelization overhead. Think optimizing a game, like tweaking settings to survive the night in Fnaf – every adjustment counts to avoid jumpscares and maximize efficiency! Expensive iteratee functions benefit more from parallel queries, while I/O bound operations depend on hardware. Testing is crucial.

Fnaf - Tuesday, June 10, 2025 7:29:56 PM

Dixin's Blog

[LINQ via C# series]

[Parallel LINQ in Depth series]

Sequential query vs. parallel query

CPU bound operation vs. I/O bound operation

Factors to impact performance

Summary

44 Comments