Search Unity

Search

Code Optimization Practices

Discussion in 'Scripting' started by hamsterbytedev, May 10, 2015.

?

How familiar are you with code optimization?

Very familiar; I always keep optimization in mind and try to use the best practices.

20 vote(s)

48.8%
Somewhat familiar; I know it is a big deal and I do what I can.

12 vote(s)

29.3%
I've heard of it; I know what it is, I know it is a big deal, but I've not done much.

5 vote(s)

12.2%
What's optimization? Bro, I have Quad SLI Titan X's, 64GB of DDR4, and a 5970K. Who cares!

4 vote(s)

9.8%

Page 1 of 2

hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353
Preamble

Code optimization is very important for developers; especially video game developers. In a world where reflexes and timing are so important you have to keep optimization always in mind. There are good practices and bad practices for optimizing your code.

In providing some support on another thread I received some information regarding a more optimized approach to shifting the contents of an array. Another community member, @steego, brought several variations of this sort of data manipulation to my attention.

With my curiosity piqued, I set about writing a few classes to test his suggestions and get a clear illustration of which approach is actually the most efficient, but I digress. Who cares how this all came about? You just want to optimize your code and get back to business, right? Let's continue!

Question

What is the most efficient way to generate and shift a collection of integers? What is the best way to store these integers? Moreover, what is the best way to test the efficiency of a block of code and compare it against other solutions that produce the same result?

Methodology

I used two custom classes to test this. I set these classes up in such a way that you could simply place a method before and after your block of code and those methods work in tandem to return the time elapsed between them to the debug console. Simple enough, right?

The first class I'm going to outline is the static class that I used to call these methods. It just contains a Dictionary and two methods.

Code (CSharp):

using System.Collections.Generic;

using hamsterbyte.MethodTimer;

public static class MethodTimer {

public static Dictionary<string, MethodTimerObject> methodDictionary;

public static void Start(string name) {

if(methodDictionary == null)

methodDictionary = new Dictionary<string, MethodTimerObject>();

methodDictionary.Add(name, new MethodTimerObject(name));

methodDictionary[name].Start();

}

public static void End(string name) {

if(methodDictionary != null) {

if(methodDictionary.ContainsKey(name)){

methodDictionary[name].End();

methodDictionary.Remove(name);

}

}

}

}

The second class is the MethodTimerObject itself. This class serves as a container for individual timers and their data/methods.

Code (CSharp):

using System.Diagnostics;

namespace hamsterbyte.MethodTimer

{

public class MethodTimerObject

{

private Stopwatch _stopWatch;

private string _name;

private float _startTime;

public string Name { get { return _name; } set { _name = value; } }

public MethodTimerObject (string name)

{

_name = name;

}

public void Start ()

{

_stopWatch = new Stopwatch();

_stopWatch.Start();

}

public void End ()

{

UnityEngine.Debug.Log ("\"" + _name + "\"" + " completed in " + _stopWatch.ElapsedMilliseconds + "ms");

_stopWatch.Stop();

}

}

}

Lastly, I used a basic Monobehaviour to test and interact with these classes and the data in question. I used a few different data containers: LinkedList<int>, Queue<int>, and of course an int[] array.

The first thing I did was write methods to populate each of these containers with 5 million random integers. Then I shifted the data in each of these containers once to the left using several different methods.

Code (CSharp):

using UnityEngine;

using System;

using System.Collections.Generic;

public class Example : MonoBehaviour {

Queue<int> _theQueue = new Queue<int>();

LinkedList<int> _theList = new LinkedList<int>();

int[] _theArray = new int[5000000];

// Use this for initialization

void Start () {

MethodTimer.Start("Populate List");

PopulateList();

MethodTimer.End("Populate List");

MethodTimer.Start("Populate Queue");

PopulateQueue();

MethodTimer.End("Populate Queue");

MethodTimer.Start("Populate Array");

PopulateArray();

MethodTimer.End("Populate Array");

MethodTimer.Start("Shift Loop Array");

_theArray = Shift(_theArray);

MethodTimer.End("Shift Loop Array");

MethodTimer.Start("Shift Copy Array");

_theArray = ShiftCopy(_theArray);

MethodTimer.End("Shift Copy Array");

MethodTimer.Start("Shift List");

ShiftList();

MethodTimer.End("Shift List");

MethodTimer.Start("Shift Queue");

ShiftQueue();

MethodTimer.End("Shift Queue");

}

public void PopulateArray() {

System.Random r = new System.Random();

for(int i = 0; i < _theArray.Length; i++){

_theArray[i] = r.Next(0, 100);

}

}

public void PopulateQueue(){

System.Random r = new System.Random();

for(int i = 0; i < 5000000; i ++)

_theQueue.Enqueue(r.Next(0, 100));

}

public void PopulateList() {

System.Random r = new System.Random();

for(int i = 0; i < 5000000; i++)

_theList.AddLast(r.Next(0, 100));

}

public int[] Shift(int[] myArray){

int[] tArray = new int[myArray.Length];

for(int i = 0; i < myArray.Length; i++){

if(i < myArray.Length - 1)

tArray[i] = myArray[i + 1];

else

tArray[i] = myArray[0];

}

return tArray;

}

public int[] ShiftCopy(int[] myArray)

{

int[] tArray = new int[myArray.Length];

int v = myArray[0];

Array.Copy(myArray, 1, tArray, 0, myArray.Length - 1);

tArray[tArray.Length - 1] = v;

return tArray;

}

public void ShiftQueue () {

int v = _theQueue.Dequeue();

_theQueue.Enqueue(v);

}

public void ShiftList () {

int v = _theList.First.Value;

_theList.RemoveFirst();

_theList.AddLast(v);

}

}

Using these two classes to return the time elapsed during the execution of a code block returned some very interesting results. Keep in mind that I am not claiming this is the best way to test something like this; apparently the built in profiler has similar methods to the ones I've written. You can enclose any code you want between the two methods as illustrated above and it will return a debug statement telling you how long it took to execute that code. Let's move on.

Results

The results of my testing with this methodology were quite interesting and they lead me to believe this same type of procedure could be applied to any code; this methodology, though unrefined, is universal.

Here's a graphic displaying a block of results in the debugger:

I took the liberty of sorting these in the script by order of their efficiency. As you can see the speed with which I was able to populate the data in various containers differed quite a bit; LinkedList<int> took the longest to populate and Queue<int> was a close second behind the array.

Shifting this data was a whole different story. I used two methods for shifting the data in the array: A basic for loop and a copy and replace style method (more details can be gleaned from the Monobehaviour above). I found that the for loop was on average 10x slower than the copy and replace method.

When shifting the LinkedList<int> and Queue<int> things got really interesting Both of these methods returned an average of less than 1ms; in fact, most of the time they returned 0ms.

Your results will vary every time you check the elapsed time of your code, this is not an error, it is the expected behaviour. The best thing to do is compile a large amount of results and calculate the average; you will have outliers in your data set.

Summary

In closing, I think all developers should be aware of good optimization techniques. This is especially important if you intend to develop applications for mobile platforms. Mobile devices have come a long way in the last few years, but they still pale in comparison to a console and especially to a high end gaming PC.

The best thing to do is always be aware of optimization. Make yourself knowledgeable about good optimization practices and apply that in your work. Feel free to use the scripts in this thread to test the efficiency of your code. Completely open-source. I may refine the concept even more and set up a public repository for the project on git. If anyone has any suggestions for refining my methodology I would love to hear them as I intend to use this to optimize blocks of my code from this point forward. Thanks for taking the time to read this thread and best of luck to each and every one of you in your future endeavours!
Last edited: May 10, 2015

hamsterbytedev, May 10, 2015

#1

leni8ec, ssojyeti2, rakkarage and 3 others like this.
MrDahl

Joined:

Oct 29, 2013

Posts:

5

Very good explanation aswell as a nice image to illustrate the result. Thanks alot for your work.

MrDahl, May 10, 2015

#2

hamsterbytedev likes this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

MrDahl said: ↑

Very good explanation aswell as a nice image to illustrate the result. Thanks alot for your work.
Click to expand...

If this helps some people understand optimization concepts a little better then I feel like I've done something good. I'm sorry we kind of hijacked your thread with a discussion on optimization. On the bright side, I think we can all use this to help us out with our own optimization, so some good has definitely come from it. Feel free to use this as you see fit. It's such a simple idea and I think it should be available to everyone.

hamsterbytedev, May 10, 2015

#3
steego

Joined:

Jul 15, 2010

Posts:

969
Nice writeup, and somewhat expected results. For a Queue, enqueuing and dequeuing are O(1) operations, the same goes for a LinkedList, where Remove/Add First/Last are all O(1) operations.

Shifting by looping, by requiring a loop, is however an O(n) operation.

Array.Copy is the odd one out, I'm guessing it relies on some form of mem-copy internally, so is also O(n), but it operates on bigger chunks.

The most surprising result to me is the time it takes to populate the list, I can't see a reason for this taking so much longer.

It might be an idea to try generating the list and queue from the array.

Code (csharp):

Queue<int> q = new Queue<int>(array);

LinkedList<int> l = new LinkedList<int>(array);

I just came across this, http://bigocheatsheet.com you might find this interesting.
steego, May 10, 2015

#4

hamsterbytedev likes this.
Fajlworks

Joined:

Sep 8, 2014

Posts:

344

Thanks for the writeup, those results look interesting. Worst of all, didn't even know there is a built in Queue class =.='

Fajlworks, May 10, 2015

#5
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

steego said: ↑

I just came across this, http://bigocheatsheet.com you might find this interesting.
Click to expand...

Thanks, I've bookmarked that page as I think it will be quite useful in the future. I went ahead and populated the list and queue from the values in the array. There is no appreciable difference.

hamsterbytedev, May 10, 2015

#6
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

Fajlworks said: ↑

Thanks for the writeup, those results look interesting. Worst of all, didn't even know there is a built in Queue class =.='
Click to expand...

Now, you do, and hopefully you have some idea about code optimization now

hamsterbytedev, May 10, 2015

#7
Fajlworks

Joined:

Sep 8, 2014

Posts:

344
I've decided to check performance for a thing that I wondered for some time; does using properties impact performance since they use method calls under the hood?

I've written a mockup script, something like:

Code (CSharp):

using UnityEngine;

using System.Collections;

using System.Diagnostics;

public class CodePerformanceTester : MonoBehaviour

{

Stopwatch stopwatch1 = new Stopwatch();

Stopwatch stopwatch2 = new Stopwatch();

int testVariable = 0;

public int TestProperty

{

get

{

return testVariable;

}

set

{

testVariable = value;

}

}

// Use this for initialization

void Start ()

{

Invoke("Test", 1f);

}

void Test()

{

stopwatch1.Start();

for (int i = 0; i < 1000000; i++)

{

testVariable = i;

}

stopwatch1.Stop();

long result1 = stopwatch1.ElapsedMilliseconds;

stopwatch2.Start();

for (int i = 0; i < 1000000; i++)

{

TestProperty = i;

}

stopwatch2.Stop();

long result2 = stopwatch2.ElapsedMilliseconds;

UnityEngine.Debug.Log("Set variable time: "+result1+"ms");

UnityEngine.Debug.Log("Set property time: "+result2+"ms");

}

}

And results were somewhat expected:

Property time varied between: 13-19ms when I tested this multiple times. Hope this helps someone further optimise their code!
Fajlworks, May 10, 2015

#8

leni8ec, landon912 and hamsterbytedev like this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

@Fajlworks Good find! I always suspected this to be the case. A few milliseconds may seem negligible, but if you are not careful that can certainly add up.

hamsterbytedev, May 10, 2015

#9
Zuntatos

Joined:

Nov 18, 2012

Posts:

612
Your example code does shift list twice, where it seemingly should be once shiftlist and once shiftqueue.

Code (CSharp):

MethodTimer.Start("Shift List");

ShiftList();

MethodTimer.End("Shift List");

MethodTimer.Start("Shift Queue");

ShiftList();

MethodTimer.End("Shift Queue");
Zuntatos, May 10, 2015

#10

hamsterbytedev likes this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353
Zuntatos said: ↑

Your example code does shift list twice, where it seemingly should be once shiftlist and once shiftqueue.

Code (CSharp):

MethodTimer.Start("Shift List");

ShiftList();

MethodTimer.End("Shift List");

MethodTimer.Start("Shift Queue");

ShiftList();

MethodTimer.End("Shift Queue");

Click to expand...

#facepalm Complete and total oversight. This has been addressed and the code has been updated to reflect the change. However, it made no difference in the completion time of the method. Which means I don't need to change the image
hamsterbytedev, May 10, 2015

#11
Kiwasi

Joined:

Dec 5, 2013

Posts:

16,860

Nice write up. Since we lost the bench marking article over on Unity gems it's worth having something like this available.

It's worth noting the profiler can do this, using the BeginSample and EndSample methods.

I'll also include my regular warning about preoptimisation. It's easy to get so caught up in tiny optimisations that the big picture code gets unreadable and unmaintainable. Finish the project first. Then do big picture optimisations. Then drop down to individual lines of hot code.

Kiwasi, May 10, 2015

#12

passerbycmc, hamsterbytedev and Fajlworks like this.
Fajlworks

Joined:

Sep 8, 2014

Posts:

344

Yeah, they don't say "Premature optimization is the root of all evil" for nothing

Fajlworks, May 10, 2015

#13

passerbycmc and Kiwasi like this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

BoredMormon said: ↑

Nice write up. Since we lost the bench marking article over on Unity gems it's worth having something like this available.

It's worth noting the profiler can do this, using the BeginSample and EndSample methods.

I'll also include my regular warning about preoptimisation. It's easy to get so caught up in tiny optimisations that the big picture code gets unreadable and unmaintainable. Finish the project first. Then do big picture optimisations. Then drop down to individual lines of hot code.
Click to expand...

So the profiler does have this capability? This is news to me. Definitely going to look into that further! Thanks

Also, I completely agree with not getting caught up in optimization until later into a project. Don't want to get too nit picky until you are at that stage in development. I always throw stuff down quick and dirty first.

hamsterbytedev, May 10, 2015

#14
superpig

Drink more water! Unity Technologies

Joined:

Jan 16, 2011

Posts:

4,660

You should see what happens if you use the constructor for Queue that takes a capacity, so that all the needed space is allocated up-front, instead of the default constructor.

superpig, May 10, 2015

#15

ssojyeti2 and Kiwasi like this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

superpig said: ↑

You should see what happens if you use the constructor for Queue that takes a capacity, so that all the needed space is allocated up-front, instead of the default constructor.
Click to expand...

Just ran another test and found that there was no appreciable difference in speed either way the Queue is constructed:

Was a good thought though. I thought it might improve it a bit as well, but according to my methodology one is as good as the other

EDIT

There was an error in my script. It is actually faster. Keep reading for more details.

Last edited: May 11, 2015

hamsterbytedev, May 11, 2015

#16
superpig

Drink more water! Unity Technologies

Joined:

Jan 16, 2011

Posts:

4,660

Curious. Could you post your updated test code?

For reference, here is the Queue source code.

superpig, May 11, 2015

#17
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353
superpig said: ↑

Curious. Could you post your updated test code?

For reference, here is the Queue source code.
Click to expand...

Ahhhhhhh, that's better. I had a little issue in my script. It's running much faster than a queue that is not preallocated.

There was no appreciable difference because I was setting them both to a new instance of Queue instead of using the existing allocation from the second Queue:

Here's the source:

Code (CSharp):

using UnityEngine;

using System;

using System.Collections.Generic;

public class Example : MonoBehaviour

{

LinkedList<int> _theList = new LinkedList<int> ();

int[] _theArray = new int[5000000];

int[] _tempArray = new int[5000000];

Queue<int> _preAllocatedQueue = new Queue<int>(5000000);

Queue<int> _unallocatedQueue = new Queue<int>();

// Use this for initialization

void Start ()

{

MethodTimer.Start ("Populate Array Sorted");

PopulateArraySorted ();

MethodTimer.End ("Populate Array Sorted");

MethodTimer.Start ("Populate Queue");

PopulateQueue ();

MethodTimer.End ("Populate Queue");

MethodTimer.Start ("Populate Preallocated Queue");

PopulatePreQueue ();

MethodTimer.End ("Populate Preallocated Queue");

}

public void PopulateArrayRandom ()

{

System.Random r = new System.Random ();

for (int i = 0; i < _theArray.Length; i++) {

_theArray [i] = r.Next (0, 100);

}

}

public void PopulateArraySorted ()

{

System.Random r = new System.Random ();

for (int i = 0; i < _theArray.Length; i++) {

_theArray [i] = r.Next (0, 100);

}

}

public void PopulateQueue ()

{

_unallocatedQueue = new Queue<int> (_theArray);

}

public void PopulatePreQueue ()

{

for(int i = 0; i < _theArray.Length; i++){

_preAllocatedQueue.Enqueue(_theArray[i]);

}

}

public void PopulateList ()

{

_theList = new LinkedList<int> (_theArray);

}

public int[] Shift (int[] myArray)

{

int[] tArray = new int[myArray.Length];

for (int i = 0; i < myArray.Length; i++) {

if (i < myArray.Length - 1)

tArray [i] = myArray [i + 1];

else

tArray [i] = myArray [0];

}

return tArray;

}

public int[] ShiftCopy (int[] myArray)

{

int[] tArray = new int[myArray.Length];

int v = myArray [0];

Array.Copy (myArray, 1, tArray, 0, myArray.Length - 1);

tArray [tArray.Length - 1] = v;

return tArray;

}

public int[] BubbleSort (int[] myArray)

{

int length = myArray.Length;

int temp = myArray [0];

for (int i = 0; i < length; i++) {

for (int j = i+1; j < length; j++) {

if (myArray [i] > myArray [j]) {

temp = myArray [i];

myArray [i] = myArray [j];

myArray [j] = temp;

}

}

}

return myArray;

}

public int[] InsertionSort (int[] myArray)

{

for (int i = 0; i < myArray.Length-1; i++) {

for (int j = i + 1; j > 0; j--) {

if (myArray [j - 1] > myArray [j]) {

int temp = myArray [j - 1];

myArray [j - 1] = myArray [j];

myArray [j] = temp;

}

}

}

return myArray;

}

public int[] SelectionSort(int[] myArray) {

int i;

int N = myArray.Length;

for (i=0; i < N-1; i++) {

int k = MinInArray (myArray, i);

if (i != k)

Exchange (myArray, i, k);

}

return myArray;

}

public static void Exchange (int[] myArray, int m, int n)

{

int temporary;

temporary = myArray [m];

myArray [m] = myArray [n];

myArray [n] = temporary;

}

public int MinInArray(int[] myArray, int start) {

int minPos = start;

for (int pos=start+1; pos < myArray.Length; pos++)

if (myArray [pos] < myArray [minPos])

minPos = pos;

return minPos;

}

public int FindIndexByLoop(int[] myArray, int value) {

for(int i = 0; i < myArray.Length; i++)

{

if(myArray[i] == value)

return i;

}

return -1;

}

public int BinarySearch(int[] arr, int lowBound, int highBound, int value)

{

int mid;

while (lowBound <= highBound)

{

mid = (lowBound + highBound) / 2;

if (arr[mid]<value)

{

lowBound = mid + 1;

continue;

}

else if (arr[mid] > value)

{

highBound = mid - 1;

continue;

}

else

{

return mid;

}

}

return -1;//value not found

}

public void ShiftQueue ()

{

//int v = _theQueue.Dequeue ();

//_theQueue.Enqueue (v);

}

public void ShiftList ()

{

int v = _theList.First.Value;

_theList.RemoveFirst ();

_theList.AddLast (v);

}

}

Good catch superpig!
hamsterbytedev, May 11, 2015

#18

superpig likes this.
landon912

Joined:

Nov 8, 2011

Posts:

1,579

While pretty off topic, have any of the issue with "for each" loops been fixed; or should everyone still hold onto the habit of avoiding them like the Black Plague?

It's really a pity, as they do increase readability quite considerably; and while we're at it: they're much more type safe.

landon912, May 11, 2015

#19
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

landon91235 said: ↑

While pretty off topic, have any of the issue with "for each" loops been fixed; or should everyone still hold onto the habit of avoiding them like the Black Plague?

It's really a pity, as they do increase readability quite considerably; and while we're at it: they're much more type safe.
Click to expand...

I could be wrong but as far as I know for each iteration is still slower than a normal for loop. You can certainly test that theory from the classes I've posted above. I'd test it right now to verify but I'm away from my computer. If you do test it go ahead and post your results here. We'd like to see them For sure.

hamsterbytedev, May 11, 2015

#20
JamesLeeNZ

Joined:

Nov 15, 2011

Posts:

5,616

If its still allocating an enumerator, then yes, they should still be avoided.

safer bet is to just avoid them. Not sure why you think they are considered more typesafe though?

JamesLeeNZ, May 11, 2015

#21

hamsterbytedev likes this.
Kiwasi

Joined:

Dec 5, 2013

Posts:

16,860

For arrays and lists I tend to use for loops. They end up being more flexible, and also let you do naughty things like modify a collection as you iterate through it.

foreach does make sense on collections that a for loop just won't for on. Like a dictionary or hashset. It also lets you run code on any enumerator. And this lets you do fun things like coroutines.

So right loop for the right job.

Kiwasi, May 11, 2015

#22

hamsterbytedev likes this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

BoredMormon said: ↑

For arrays and lists I tend to use for loops. They end up being more flexible, and also let you do naughty things like modify a collection as you iterate through it.

foreach does make sense on collections that a for loop just won't for on. Like a dictionary or hashset. It also lets you run code on any enumerator. And this lets you do fun things like coroutines.

So right loop for the right job.
Click to expand...

Agreed. If foreach loops didn't have a purpose they would probably just be deprecated and wouldn't exist anymore. They just aren't as efficient as a traditional for loop. I prefer not to use them at all unless I have to iterate over a dataset that can't be accessed by way of an index; like the two examples you gave.

hamsterbytedev, May 11, 2015

#23
Kiwasi

Joined:

Dec 5, 2013

Posts:

16,860

I still use them a lot. Especially during early development. foreach is easier to read, and more forgiving on changing the underlying collection type. I use a lot of custom collections, its sometimes easier to use a generic foreach then to redo the for loops each time the underlying collection changes.

Kiwasi, May 11, 2015

#24
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

BoredMormon said: ↑

I still use them a lot. Especially during early development. foreach is easier to read, and more forgiving on changing the underlying collection type. I use a lot of custom collections, its sometimes easier to use a generic foreach then to redo the for loops each time the underlying collection changes.
Click to expand...

I haven't done much with custom collections. I haven't had to with all the existing ways to store a collection of data. If I need to store a custom data class, like a class that contained information about an item for example, I tend to favour arrays, lists, or dictionaries to aggregate a bunch of them and iterate through them that way. Would you say this is bad practice, if so, what kind of benefit does using a custom collection offer? I'm sure I'm not the only person who is curious about this

hamsterbytedev, May 11, 2015

#25
Kiwasi

Joined:

Dec 5, 2013

Posts:

16,860

Custom collections are when I want my data just so. There are plenty of things you can do with the regular collections, but also plenty of things you can't do.

For example in my infinite terrain generator I've implemented a Map collection. This collection is generic, can be indexed like a 2D array, expands infinitely automatically, does not throw index out of range exceptions, allows for negative indexes, and is O(1) for access operations. You would struggle to find one of the built in collections that does exactly this.

Most of the time custom collections are simply wrappers for the built in collections. Just like a List is a wrapper for an array. My Map collection wraps a set of four 2D arrays. But I hardly want to keep the logic holding the four arrays together to be interspersed through my code base. With a custom collection I can simply access Map[4,-5] and forget about what is happening underneath.

Kiwasi, May 11, 2015

#26

landon912 and hamsterbytedev like this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

Yep, that's a very valid point. I've never had to deal with anything like that, so the topic never really crossed my mind. It's quite an interesting approach though. I'll make a note of it in case I have to use a similar approach to something in the future. Cheers!

hamsterbytedev, May 11, 2015

#27
superpig

Drink more water! Unity Technologies

Joined:

Jan 16, 2011

Posts:

4,660

landon91235 said: ↑

While pretty off topic, have any of the issue with "for each" loops been fixed; or should everyone still hold onto the habit of avoiding them like the Black Plague?
Click to expand...

There was never an issue with foreach loops when used with arrays.

When used with other standard collections, they allocate a small amount of memory (usually about 24 bytes) once the loop has finished running. I don't think it's the terrifying result that people make it out to be - especially in code that doesn't run every single frame, and especially not on desktop platforms - and that it's only appropriate to worry about it once profiling shows it to be a problem; it's also easily solved by just moving your code to a DLL that you compile in Visual Studio, instead of loose scripts in Unity, so it's the sort of thing you can just do at the end of a project to tidy things up.

superpig, May 11, 2015

#28

passerbycmc, Kiwasi and hamsterbytedev like this.
superpig

Drink more water! Unity Technologies

Joined:

Jan 16, 2011

Posts:

4,660

Just to add a little more detail:

When you use a foreach loop with an array, internally it's compiled to use a loop counter variable - i.e. what the CPU ends up doing should be pretty much identical to what it does for a for loop.

When you use a foreach loop with a standard collection - System.Collections.Generic.List<T> and so on - then an enumerator is created (as a struct, so no GC allocation - the 24 bytes I mentioned comes from an inefficiency in the way we clean up at the end of the loop). Each iteration of the loop, MoveNext() is called. Here is the implementation of MoveNext for a List; you can see it does a bit more work than a basic for-loop would, checking that the list has not been modified while you are iterating over it, that the iterator is still within the bounds of the list, etc. This is unlikely to have a significant performance impact compared to using a for-loop, though... and if it turns out that it does, you can always change to an index-based loop later on easily. That sort of thing is a micro-optimisation, though, and is exactly the kind of peephole optimisation that a compiler should be taking care of for you. (Once we have an up-to-date compiler, anyway...)

Also, just a thought regarding analysis of some of this stuff: things like Big-O notation are useful, but you should never lose sight of the actual raw time taken by things. It might be that adding items to data structure A is O(1) while adding items to data structure B is O(n); so it sounds like A is a better choice, right? But O(1) just means 'constant time' - it doesn't say anything about what that time is. It could be that adding an item to data structure A always takes 1ms. Meanwhile, adding an item to data structure B might take n*0.01ms - so while it could become more expensive with a lot of items, if you know that in practice it's going to have fewer than 100, it's a better choice.

superpig, May 11, 2015

#29

landon912, hippocoder, Kiwasi and 4 others like this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

@superpig is a fountain of useful information. I too noticed discrepancies between the Big-O information and real world application; specifically in sorting an array. Moreover, the for vs foreach argument will rage on until the efficiencies of both loops are equal. It isn't really relevant to the discriminating optimizer that the foreach loop isn't actually that much slower.

I ran tests against each of these to get an accurate idea of just what the difference was. I populated an array from an array using both loop styles and an array from a list using both loops styles. On a dataset of 5 million integers there was a ~10ms difference in the array to array population and a ~150-200 millisecond difference in the list to array population; always in favour of the standard for loop. This may seem negligible, but it's still a reality.

I guess the point I'm trying to make is that it may seem picky to choose a for loop over a foreach loop the fact remains that the for loop is slightly faster. Here's a good way to look at it - if someone actually wanted to put 4 Titan X's in their system, even though we all know SLI scaling depreciates more and more with every card you bridge, some would do it just because it would give them a minor increase in performance. The increase would be barely noticeable and it would only come up in certain situations. The kind of person who is willing to spend $1200 on another Titan to get an extra ~10 FPS is the kind of person who is going to flame you for using a foreach loop instead of a for loop.

When it comes to shaving milliseconds off execution times the for loop always prevails because it does not require the extra instructions a foreach loop does. The difference might be small, and well within a margin that is more than acceptable, but for loops are still faster.

I would like to see this eliminated by the compiler so we can finally settle the debate, but even when it is you are going to have proponents on both sides of the argument. I found myself in a similar debate last week on another thread about how to properly organize your code and make it more readable; a lot of this is personal preference.

At the end of the day, if a few milliseconds bother you - it's seriously ~10ms over a 5 million point array of integers - then just use a for loop whenever you can and you don't have to worry about it. If you don't need that level of efficiency then it really doesn't matter which one you use. If you find that foreach is easier for you to read then just use it and don't concern yourself with the consequences - they are so slight you probably won't notice them anyway.

All that being said, I do a fair bit of development for mobile platforms. Optimization here is incredibly important and I tend to be very anal about it. I have a few edge-case applications where I do a lot of iteration over lists and arrays and I was able to increase performance by eliminating foreach loops.and swapping lists for arrays where possible. Keep in mind that the applications to which I am referring are special case scenarios where I'm iterating over data at the end of every frame in a coroutine. You results may vary.

hamsterbytedev, May 11, 2015

#30

Kiwasi likes this.
JamesLeeNZ

Joined:

Nov 15, 2011

Posts:

5,616

BoredMormon said: ↑

I still use them a lot. Especially during early development. foreach is easier to read, and more forgiving on changing the underlying collection type. I use a lot of custom collections, its sometimes easier to use a generic foreach then to redo the for loops each time the underlying collection changes.
Click to expand...

I just use for straight up these days. First project used foreach, was a massive task going and finding everyone and replacing it with for loops. Foreach looks nicer, but only saves a marginal amount of typing

JamesLeeNZ, May 11, 2015

#31
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

JamesLeeNZ said: ↑

I just use for straight up these days. First project used foreach, was a massive task going and finding everyone and replacing it with for loops. Foreach looks nicer, but only saves a marginal amount of typing
Click to expand...

That's basically my feeling in the subject too.

hamsterbytedev, May 11, 2015

#32

JamesLeeNZ likes this.
Deleted User

Guest

Did you post this because of the way I use gameobjects as "folders"
Great thread anyway.
What about Memory Optimization? You should cover that too.

Last edited by a moderator: May 12, 2015

Deleted User, May 11, 2015

#33
JamesLeeNZ

Joined:

Nov 15, 2011

Posts:

5,616

This is about memory optimization... not sure how you drew a correleation about folder structure to the op 's tests

JamesLeeNZ, May 12, 2015

#34
Deleted User

Guest

Seems like he is testing the execution time of different methods & stuff and iterating over lists vs arrays. That's not memory optimization. Memory optimization is using a bigger memory footprint than you need. Not sure why you're mentioning folder structure because I never brought it up. I think you misunderstood my post.

Deleted User, May 12, 2015

#35
JamesLeeNZ

Joined:

Nov 15, 2011

Posts:

5,616

which are all in memory...

I dont feel that the type of memory opt youre talking about has anything to do with code.

JamesLeeNZ, May 12, 2015

#36
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

supremegrandruler said: ↑

Did you post this because of the way I use gameobjects as "folders"
Great thread anyway.
What about Memory Optimization? You should cover that too.
Click to expand...

This was not related in any way to the way you organize your game objects. Though, making extraneous objects should definitely be avoided for optimization, this thread is actually the result of a different unrelated conversation about the efficiency of data collections. Since you brought it up though, I can definitely go into more detail about why adding extraneous game objects is a bad idea and post some data from that as well. If anyone is interested?

hamsterbytedev, May 12, 2015

#37
Deleted User

Guest

JamesLeeNZ said: ↑

which are all in memory...

I dont feel that the type of memory opt youre talking about has anything to do with code.
Click to expand...

Actually it does. Different objects have different memory footprints. A dictionary has a bigger memory footprint than an array. Other examples of memory optimization: loading an xml in memory when you don't need to; sprite batching.

@Hamsterbyte.LLC Was just j/king about that. Yeah go ahead. Although now I trimmed it down to 1 object for scripts + 1 for graphics + 1 for collider

Last edited by a moderator: May 12, 2015

Deleted User, May 12, 2015

#38
JamesLeeNZ

Joined:

Nov 15, 2011

Posts:

5,616

supremegrandruler said: ↑

Actually it does. Different objects have different memory footprints. A dictionary has a bigger memory footprint than an array. Other example: loading an xml in memory when you don't need to; sprite batching.
Click to expand...

which is a so trivial its not even worth discussion... if you're picking array over dictionary because of the memory footprint, you're doing it wrong.

sprite batching has nothing to do with code.

JamesLeeNZ, May 12, 2015

#39

hamsterbytedev and Kiwasi like this.
Kiwasi

Joined:

Dec 5, 2013

Posts:

16,860

JamesLeeNZ said: ↑

which is a so trivial its not even worth discussion... if you're picking array over dictionary because of the memory footprint, you're doing it wrong.
Click to expand...

I dunno, could be relevant if you are coding in C++ for a micro processor.

Kiwasi, May 12, 2015

#40
JamesLeeNZ

Joined:

Nov 15, 2011

Posts:

5,616

absolutely!

It's about picking the right thing for the right job. I wouldn't pick an array if I wanted direct item access, just like I wouldnt pick a dictionary if I was primarily going to iterate over a collection constantly.

JamesLeeNZ, May 12, 2015

#41
Deleted User

Guest

JamesLeeNZ said: ↑

which is a so trivial its not even worth discussion... if you're picking array over dictionary because of the memory footprint, you're doing it wrong.

sprite batching has nothing to do with code.
Click to expand...

That was just an example. You completely missed the point.

I never said sprite batching had anything to do with code. Reread my post and tell me where I said it did. He posted about code optimization, and I said he should make a thread about memory optimization.

Last edited by a moderator: May 12, 2015

Deleted User, May 12, 2015

#42
steego

Joined:

Jul 15, 2010

Posts:

969

superpig said: ↑

Also, just a thought regarding analysis of some of this stuff: things like Big-O notation are useful, but you should never lose sight of the actual raw time taken by things.
Click to expand...

I agree. The good thing about knowing some Big-O analysis, is that you can more easily see which of your algorithms are good candidates for optimisation, and which alternatives might be a better substitute.

That being said, I don't think this mantra has been brought up in this thread yet: When optimising, always profile. Then profile some more.

Compilers and processors do so many crazy things that it's near impossible to reason about it. In some cases an obviously slower algorithm could outperform an obviously faster one because it just happens to fit in the processors cache for example.

Even processors are sort-of compilers these days, compiling machine code into micro-ops and doing things like branch prediction and out of order execution.

So don't trust your intuition, profile.

steego, May 12, 2015

#43

Ryiah and Kiwasi like this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353

@steego your brilliance and willingness to help others is a great asset not only to this thread but to the community as a whole. Your advice is always good advice and it is clear to me that you have both the passion and expansive knowledge that it takes to excel in the software world. I believe I speak for all of us when I say never stop being that person. Cheers!

On a less related note I will be testing memory footprints of various data collections as I find the time to do it. I'm looking at you @supremegrandruler .I will update the original thread when I have results to show. Be advised that @JamesLeeNZ has provided some useful, albeit scathing, insight on the matter.

JamesLeeNZ said: ↑

which is a so trivial its not even worth discussion... if you're picking array over dictionary because of the memory footprint, you're doing it wrong.

sprite batching has nothing to do with code.
Click to expand...

I'd like to keep this thread civil without starting any flame wars. The information contained herein is intended to help everyone, so let's try to be as kind and helpful as possible. The only stupid questions are the ones you don't ask. Instead of saying that something is trivial and not worth discussion, explain why you think it is trivial in a clear and concise manner instead of being dismissive or demeaning. Anyways, thanks for keeping the thread alive guys! This is a very productive conversation for all of us!

hamsterbytedev, May 12, 2015

#44
Fajlworks

Joined:

Sep 8, 2014

Posts:

344
If you're using:

Code (CSharp):

Vector3 someVector;

void Update()

{

bool result = someVector.Equals( Vector3.one );

}

You might want to change that since it is always allocating 28bytes.

Just create a utility class like:

Code (CSharp):

public class VectorUtilities

{

public static bool Compare(Vector2 vec1, Vector2 vec2)

{

if (vec1.x == vec2.x &&

vec1.y == vec2.y)

return true;

return false;

}

public static bool Compare(Vector3 vec1, Vector3 vec2)

{

if (vec1.x == vec2.x &&

vec1.y == vec2.y &&

vec1.z == vec2.z)

return true;

return false;

}

}

If you do manual check, you won't allocate them bytes which can be useful when used in Update() calls.
Fajlworks, May 12, 2015

#45

hamsterbytedev likes this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353
Fajlworks said: ↑

If you're using:

Code (CSharp):

Vector3 someVector;

void Update()

{

bool result = someVector.Equals( Vector3.one );

}

You might want to change that since it is always allocating 28bytes.

Just create a utility class like:

Code (CSharp):

public class VectorUtilities

{

public static bool Compare(Vector2 vec1, Vector2 vec2)

{

if (vec1.x == vec2.x &&

vec1.y == vec2.y)

return true;

return false;

}

public static bool Compare(Vector3 vec1, Vector3 vec2)

{

if (vec1.x == vec2.x &&

vec1.y == vec2.y &&

vec1.z == vec2.z)

return true;

return false;

}

}

If you do manual check, you won't allocate them bytes which can be useful when used in Update() calls.

Click to expand...

I have my own methods for returning the memory allocation of a certain dataset; I won't be using the profiler for this either. I want people not only to understand that things are the way they are, but why they are that way, and how they can test the same thing anywhere without the unity profiler. My approach is one of full disclosure and though the profiler is quite awesome it doesn't tell you how it works; it just works and you come to rely on it. Everyone has a cell phone and nobody knows your phone number because they can get it at the touch of a button any time they want.

Convenience is great, but it also stifles the mind and imagination. I think to be a great coder you have to know why things work, how they work, and not just that they do work.

That being said, you've certainly made a noteworthy observation. One thing I would add is that GC.GetTotalMemory() returns bytes that are 'thought' to be allocated. Not sure what the implications of this are and I'm not sure if unity is using that to populate that particular field in the profiler, but I do know that thinking something is not the same as knowing something.
hamsterbytedev, May 12, 2015

#46
Deleted User

Guest

Hamsterbyte.LLC said: ↑

On a less related note I will be testing memory footprints of various data collections as I find the time to do it. I'm looking at you @supremegrandruler .I will update the original thread when I have results to show. Be advised that @JamesLeeNZ has provided some useful, albeit scathing, insight on the matter.
Click to expand...

Not really, he didn't say anything I already knew: he just misunderstood everything from the beginning:
- First he thought the "gameobjects folder" thing was addressed at him when it clearly wasn't, and ofc, ended up having no idea what he was talking about because he wasn't the one the reply was addressed to
- Two he attacked an example instead of attacking the argument itself.
- Three he misunderstood saying I said sprite batching had everything to do with coding when it was just an example of memory optimization I gave
- Fourth, correct me if I'm using the terms incorrectly, but code execution speed has nothing to do with memory

Last edited by a moderator: May 12, 2015

Deleted User, May 12, 2015

#47
hippocoder

Digital Ape

Joined:

Apr 11, 2010

Posts:

29,723

Code execution speed actually has a great deal to do with memory. The allocation and deallocation is one of the most costly things that slow down your code execution.

hippocoder, May 12, 2015

#48

hamsterbytedev and Deleted User like this.
hamsterbytedev

Joined:

Dec 9, 2014

Posts:

353
hippocoder said: ↑

Code execution speed actually has a great deal to do with memory. The allocation and deallocation is one of the most costly things that slow down your code execution.
Click to expand...

This is exactly correct. Memory allocation is expensive; this principal is clearly illustrated by the tests I ran between a preallocated queue and one that was not preallocated

Hamsterbyte.LLC said: ↑

Ahhhhhhh, that's better. I had a little issue in my script. It's running much faster than a queue that is not preallocated.

There was no appreciable difference because I was setting them both to a new instance of Queue instead of using the existing allocation from the second Queue:

View attachment 138182

Here's the source:

Code (CSharp):

using UnityEngine;

using System;

using System.Collections.Generic;

public class Example : MonoBehaviour

{

LinkedList<int> _theList = new LinkedList<int> ();

int[] _theArray = new int[5000000];

int[] _tempArray = new int[5000000];

Queue<int> _preAllocatedQueue = new Queue<int>(5000000);

Queue<int> _unallocatedQueue = new Queue<int>();

// Use this for initialization

void Start ()

{

MethodTimer.Start ("Populate Array Sorted");

PopulateArraySorted ();

MethodTimer.End ("Populate Array Sorted");

MethodTimer.Start ("Populate Queue");

PopulateQueue ();

MethodTimer.End ("Populate Queue");

MethodTimer.Start ("Populate Preallocated Queue");

PopulatePreQueue ();

MethodTimer.End ("Populate Preallocated Queue");

}

public void PopulateArrayRandom ()

{

System.Random r = new System.Random ();

for (int i = 0; i < _theArray.Length; i++) {

_theArray [i] = r.Next (0, 100);

}

}

public void PopulateArraySorted ()

{

System.Random r = new System.Random ();

for (int i = 0; i < _theArray.Length; i++) {

_theArray [i] = r.Next (0, 100);

}

}

public void PopulateQueue ()

{

_unallocatedQueue = new Queue<int> (_theArray);

}

public void PopulatePreQueue ()

{

for(int i = 0; i < _theArray.Length; i++){

_preAllocatedQueue.Enqueue(_theArray[i]);

}

}

public void PopulateList ()

{

_theList = new LinkedList<int> (_theArray);

}

public int[] Shift (int[] myArray)

{

int[] tArray = new int[myArray.Length];

for (int i = 0; i < myArray.Length; i++) {

if (i < myArray.Length - 1)

tArray [i] = myArray [i + 1];

else

tArray [i] = myArray [0];

}

return tArray;

}

public int[] ShiftCopy (int[] myArray)

{

int[] tArray = new int[myArray.Length];

int v = myArray [0];

Array.Copy (myArray, 1, tArray, 0, myArray.Length - 1);

tArray [tArray.Length - 1] = v;

return tArray;

}

public int[] BubbleSort (int[] myArray)

{

int length = myArray.Length;

int temp = myArray [0];

for (int i = 0; i < length; i++) {

for (int j = i+1; j < length; j++) {

if (myArray [i] > myArray [j]) {

temp = myArray [i];

myArray [i] = myArray [j];

myArray [j] = temp;

}

}

}

return myArray;

}

public int[] InsertionSort (int[] myArray)

{

for (int i = 0; i < myArray.Length-1; i++) {

for (int j = i + 1; j > 0; j--) {

if (myArray [j - 1] > myArray [j]) {

int temp = myArray [j - 1];

myArray [j - 1] = myArray [j];

myArray [j] = temp;

}

}

}

return myArray;

}

public int[] SelectionSort(int[] myArray) {

int i;

int N = myArray.Length;

for (i=0; i < N-1; i++) {

int k = MinInArray (myArray, i);

if (i != k)

Exchange (myArray, i, k);

}

return myArray;

}

public static void Exchange (int[] myArray, int m, int n)

{

int temporary;

temporary = myArray [m];

myArray [m] = myArray [n];

myArray [n] = temporary;

}

public int MinInArray(int[] myArray, int start) {

int minPos = start;

for (int pos=start+1; pos < myArray.Length; pos++)

if (myArray [pos] < myArray [minPos])

minPos = pos;

return minPos;

}

public int FindIndexByLoop(int[] myArray, int value) {

for(int i = 0; i < myArray.Length; i++)

{

if(myArray[i] == value)

return i;

}

return -1;

}

public int BinarySearch(int[] arr, int lowBound, int highBound, int value)

{

int mid;

while (lowBound <= highBound)

{

mid = (lowBound + highBound) / 2;

if (arr[mid]<value)

{

lowBound = mid + 1;

continue;

}

else if (arr[mid] > value)

{

highBound = mid - 1;

continue;

}

else

{

return mid;

}

}

return -1;//value not found

}

public void ShiftQueue ()

{

//int v = _theQueue.Dequeue ();

//_theQueue.Enqueue (v);

}

public void ShiftList ()

{

int v = _theList.First.Value;

_theList.RemoveFirst ();

_theList.AddLast (v);

}

}

Good catch superpig!
Click to expand...
hamsterbytedev, May 12, 2015

#49
JamesLeeNZ

Joined:

Nov 15, 2011

Posts:

5,616

foreach will never get deprecated as it is very useful. The only reason it should be avoided in unity is because of the GC. In standard desktop apps, that 24 byte enumerator allocation is nothing, even run constantly. In Unity in an Update, its feeding the GC, which is what needs to be avoided.

Ill just leave this here... because lol

Fourth, correct me if I'm using the terms incorrectly, but code execution speed has nothing to do with memory
Click to expand...

JamesLeeNZ, May 12, 2015

#50

hamsterbytedev likes this.

(You must log in or sign up to reply here.)

Page 1 of 2