Welcome Guest! To enable all features please Login or Register.

Notification

Icon
Error

Distributed processing - running same tests across grid nodes
avishnyakov
#1 Posted : Sunday, July 12, 2015 11:55:48 PM(UTC)
Rank: Member

Groups: Registered
Joined: 7/12/2015(UTC)
Posts: 27
Location: Australia

Thanks: 5 times
Was thanked: 6 time(s) in 6 post(s)
Hello team,

Is there any option to run a particular test or set of tests across predefined list of grid nodes?
A bit of a context goes as this:

For instance, we have a set of integration tests which have to be passed on WinXP, Win7, Win8 (etc here as there are at least N other possible configurations).
So we've got 3 VMs ( WinXP, Win7, Win8 ) as NCrunch servers and we able to run our tests on a particular node.

That's where inconvenience comes along.
Yes, we can run tests against a node, but we can't make sure that tests are fine on every VM.
So far we have to run all tests against node 1, node 2 and then node 3.. which a bit pointless and with hundreds of tests simple useless.
Or we have to double/triple grid nodes (two VMs with XP, two VMs with Win8, etc) to parallel and boost up execution time.

The case goes with integration tests. Full test suit takes between 60-90 minutes to be completed against one VMs/environment.
With 3 VMs (WinXP, Win7, Win8) it just triples, requiring 3+ hours to complete.

NCrunch seems to be close to solve that issue, at least it allows us to run tests remotely against various environments. That IS awesome!
The rest of the testing is still highly manual, run against that node, wait, run against this, wait, run against that.

What would be nice to have is some kind of play list, a set of tests or something which can be associated with one or many grid nodes and be valid if only they passed on every single node.

Do we miss something in configuration?
Can NCrunch help with it?
Remco
#2 Posted : Monday, July 13, 2015 12:55:38 AM(UTC)
Rank: NCrunch Developer

Groups: Administrators
Joined: 4/16/2011(UTC)
Posts: 7,000

Thanks: 932 times
Was thanked: 1259 time(s) in 1172 post(s)
Hi, thanks for posting!

If I understand what you're asking for, the ideal feature to solve this would be one where the NCrunch engine is able to duplicate a test multiple times inside the queue, with each instance being applied to a different grid node.

I've actually considered adding this feature for some time - there is a reason it doesn't yet exist :(

When tests are run in different environments, they can behave differently. If you have a test that is run under XP, then under Win8, the physical path of execution may be different between these runs. This can create some confusing results in the NCrunch UI, which is designed to only show one result per test (the most recent one). This could be problematic when one of the tests fails, but the others pass. To display the results meaningfully, NCrunch would need to somehow correlate the execution runs and produce an aggregate report tied to the same test.

The situation could also become more confusing when considering the reliability of the network. It's possible that one of the VMs may be offline at the time the NCrunch tests are run. If NCrunch is to duplicate the tests across all connected nodes, an offline node would simply be a test case that is silently ignored. This could create a scenario where tests silently 'pass' without actually being run.

When examining the wiring behind how such a feature would need to work, the ideal solution becomes clear: These are, in fact, different tests. Each test should have its own code coverage and its own result - restricted to run on a specific node.

While it's a bit verbose, the following code structure would allow you to achieve this:

public abstract class MyFixture
{
[Test]
public void TestThatWidgetWorksCorrectlyOnThisPlatform()
{
// Insert test code here to run inside VMs across platforms
}

[Test]
public void TestThatOtherWidgetWorksCorrectlyOnThisPlatform()
{
// Insert test code here to run inside VMs across platforms
}
}

[RequiresCapability("WinXP")]
public class MyFixture_UnderWinXP: MyFixture {}

[RequiresCapability("Win7")]
public class MyFixture_UnderWin7: MyFixture {}

[RequiresCapability("Win8")]
public class MyFixture_UnderWin8: MyFixture {}


... The grid nodes (VMs) would then be accordingly configured to each have a 'capability' flag (global setting) that includes their O/S. In this way, you'll have a new test for each platform, each with its own result. An advantage of this approach is that is scales well where many systems are involved. For example, you may have two Windows XP VMs that are each able to run the WinXP tests. These VMs would then be used efficiently by the processing queue to maximise parallel execution and minimise unnecessary processing.

A serious drawback of this approach is the need to have an extra descending class for each platform. This is a constraint tied very much to the nature of test frameworks themselves, where multi-dimensional generation of tests is still limited to specific scopes (such as fixtures).

I think an ideal approach would somehow be to generate the node-specific tests. What does your test suite look like at the moment?
avishnyakov
#3 Posted : Monday, July 13, 2015 2:01:25 AM(UTC)
Rank: Member

Groups: Registered
Joined: 7/12/2015(UTC)
Posts: 27
Location: Australia

Thanks: 5 times
Was thanked: 6 time(s) in 6 post(s)
Ouch, thanks for the fast reply.
That's gonna be a big one, so have a coffee :)

>>If I understand what you're asking for, the ideal feature to solve this would be one where the NCrunch engine is able to duplicate a test multiple times inside the queue, with each instance being applied to a different grid node.
Yes, indeed!

>> When tests are run in different environments, they can behave differently.
Indeed they are.
That's a developer's responsibility to make sure that the codebase works well across environments, and same goeswith the regression test - check your environment if you really need to seup something.

The codebase of the software is the same, the test os the same and the expected bahaviour is the same.
Rarely, there is some stuff which is designed and compiled to be run onlu on XP/x64, but that' not a shared codebased which meant to be run on varopus environments. For shared codebased there is only one test shared across environments.

>>This can create some confusing results in the NCrunch UI, which is designed to only show one result per test (the most recent one)
Too bad for NCrunch :)
There is a room for improvement! Woohoo!

>> This could be problematic when one of the tests fails, but the others pass.
Expected bahaviour, that's excactly what we are looking for.
Shared codebased is meant to work on every single environment, so we extyremely keen to see such "Passed on 2 environments out of 3" in a nice looking grid view or something for failed tests and "Passed 3 out 3" for passed tests. Absolutely awesome!

Failures mean that some code does not work and needs to be fixed, so we can act accordingly.

>>To display the results meaningfully, NCrunch would need to somehow correlate the execution runs and produce an aggregate report tied to the same test.
Where and how much do we pay for that? :)
That's is true, "Test outcome" column seems to be either simple one storing one value or a complex one being green if all grid nodes are green or being red if one of the nodes fails. Aaaaa.... how much we want this! Can we write a plugin or something? :)

>>The situation could also become more confusing when considering the reliability of the network. It's possible that one of the VMs may be offline at the time the NCrunch tests are run.

That's fine. A grid node thing, not the test.
So, once connectivity is back, then the grid node is back, and then it might be reasonable update the results, is not it?
Refresh/rerunt this test against all nodes, people can live with that. S

>> If NCrunch is to duplicate the tests across all connected nodes, an offline node would simply be a test case that is silently ignored.
Ignore != Passed, so, what's the issue?
2 out of 3 passed, one is ignoired, so total across 3 VMs can't be green. That's fine.

>> the ideal solution becomes clear: These are, in fact, different tests.
Different from the developer perspestive? Not too sure.

We have 700+ regression tests which are run under 3+ environments right now.
How? Manually. Have 3+ VMs with VS, then get the latest, compile, run the tests, fix, and repeat.
Then RDP to the next one VM, same same.

Now, having these "RequiresCapability" or any other kind of attrs and (oh boyyy) inheritance wins the day.
700+ across 10+ environment with RequiresCapability and inheritance? Really?

Having sdaid that, these tests are just about funtionality, only to keep regression.
We expect to add "performance tests" to find out what works fast/slow and on which environments, plus some test for concurency/multithreading implementation. That seems to raise the bar up to 1000 tests till the end of the yeat, all of which have to be run against several environments/VMs.

Now, we surely can use T4/ReSharper/PS to generate all the tests for all the envrieonments with RequiresCapability and the otehr stuff.

Would it be a solution we need to go?
Btw, Visual Studio would go crazy with [RequiresCapability("WinXP")] and inheritance, would not it? It'd see x2/x3/x4 tests, then.


>> In this way, you'll have a new test for each platform, each with its own result. An advantage of this approach is that is scales well where many systems are involved. For example, you may have two Windows XP VMs that are each able to run the WinXP tests. These VMs would then be used efficiently by the processing queue to maximise parallel execution and minimise unnecessary processing.

That's the other thing regarding grid processing, you right.
Right now it is unknown how to setup this propertly - all the tests across several environment plus split up within envrionments.

Say, 600 tests need to be run on XP/Win7/Win8.
We have 9 VMs - 3 with XP, 3 with Win7 and 3 with Win8.

Now, we surely would be happy to split 600 into 2 batch per two VMs or later on per 3-4-6 VMs.

How'd we do that?

>>I think an ideal approach would somehow be to generate the node-specific tests. What does your test suite look like at the moment?

We have a mission critical software library, it comes in .NET45/40/35.
520+ regression tests, written once and designed to work on every tsupported environment.
We have 4 environments to support so far.
Pretty much, we have these 4 environments, get the latest, run tests and see what works and what not. Fix the issue, repeat.
It takes between 60-90 minutes per environment to get the whole test shuit run.

Now, we started exploring NCrunch and grid porocessing due to:
1) +2 new upcimong rnvironments in a few month from now, more might come later (these freaking enterprise software updates/CU/service packs)
2) Starting up a "refactoring" phase, refactor the core, but break a lot so run tests more intensively
3) Adding new two "big" features and expecting more tests added, guess 200-300 in a few months
4) "Hardering" already excisting tests - check more things, so excecution time expands

Again, all this is regression testing, there are a few unit test which take NOTHING to excecute.
Regresion tests are designed to make sure that the library works on a particular enterprise software, supports particular cases and does not break production. It's mission critical, so we don't fake/mock anything at all. Run strongly against a real enterprise software handling all bugs/issues/inconvinience and other things related to its service packs, updates and so on.

Am.. sorry for the big messages here and where, just wanted to provide more context around that.
Hope it give more visibility on our little issue. Might not see al the features of NCrunch, so absolutely open for ideas, thanks!

And still, excited to see NCrunch in actions. Terrific thing, truly!
avishnyakov
#4 Posted : Monday, July 13, 2015 2:13:00 AM(UTC)
Rank: Member

Groups: Registered
Joined: 7/12/2015(UTC)
Posts: 27
Location: Australia

Thanks: 5 times
Was thanked: 6 time(s) in 6 post(s)
But yes, for the time being we can use T4/PowerShell or something to pre-generate node-based tests with RequiresCapability.
That would work well, we put a few #ifdef or something, I see how it can work.

Having said that, there is one things I missed to mention.
There are two legacy environments where the library gets compiled in NET35.
Frankly saying, we have never run tests against that environment assuming that all be good as tests are fine with .NET40/45 on the Win7/8.
MSBuild is used to get things done.

Looking forward, with such a great potential of NCrunch we'd be keen to add regression on these old environment as well, but not sure how to configure a particular node to build lib in a particular way.
Saw setting on a solution wide level, x86/x64, etc, but did not see same setting on a grid node level.

Is that possible to get done with NCrunch?
Having a shared codebase which get copies to the grid node, get compiled either in net45/40 for Win7/8 nodes and gets compiled with net35 for WinXP node, and then tested?
Remco
#5 Posted : Monday, July 13, 2015 4:44:57 AM(UTC)
Rank: NCrunch Developer

Groups: Administrators
Joined: 4/16/2011(UTC)
Posts: 7,000

Thanks: 932 times
Was thanked: 1259 time(s) in 1172 post(s)
It's interesting how similar your setup is to that of the NCrunch codebase. NCrunch also uses a range of VMs to run tests over a range of different platforms. It compiles to .NET 3.5 too :)

I wish I could give you a simple attribute or feature that would allow the grid to just distribute the tests in the way that would be ideal for you, but I'm afraid it isn't as simple as it seems. It's true that much of the infrastructure required for this already exists (the transfer of source code, remote test execution, etc). But unfortunately, the handling and processing of results from the distributed processing is designed more for horizontally scaling the engine, rather than extensive cross-platform testing. That's not to say that cross-platform testing using distributed processing is impossible - it's just harder.

Right now NCrunch is designed under the assumption that we only have one true result for every test. As soon as we introduce a different dimension of test result (i.e. grid node), much of the client-side internals of NCrunch would need to be redesigned to make this meaningful. This would require a tremendous amount of work to achieve and at this stage I haven't thought through enough of the edge cases to know whether it's possible to arrive at a clean solution.

By far, the cleanest approach to cross-platform testing (from the side of NCrunch) would be to logically split the tests through auto-generation or some sort of abstraction. At the moment, I know of no tool that will do this without some level of effort required. The test frameworks work against us here, as NCrunch is reliant on test frameworks to physically discover the tests. This means we can't just duplicate the tests at discovery time inside NCrunch, as this blows away the abstractions built into each test framework.

.. This leads us to the one solution that will work in a reliable way: manipulating the output assembly in a post-build event to add descendants to each test fixture with the appropriate RequiresCapability attributes applied. This could be done using Mono.Cecil, or perhaps PostSharp (I am no expert on this tool).
1 user thanked Remco for this useful post.
avishnyakov on 7/13/2015(UTC)
avishnyakov
#6 Posted : Monday, July 13, 2015 5:04:56 AM(UTC)
Rank: Member

Groups: Registered
Joined: 7/12/2015(UTC)
Posts: 27
Location: Australia

Thanks: 5 times
Was thanked: 6 time(s) in 6 post(s)
Sure, thanks for the answers, appreciate that.

Well, originally that's a SharePoint related stuff. Just changes WinXP/Win7/Win8 to different versions of SharePoint, same-same.

>>By far, the cleanest approach to cross-platform testing (from the side of NCrunch) would be to logically split the tests through auto-generation or some sort of abstraction.
Right, already started looking into that.
Don't want to push one more dependency with attributes. Most likely going to generate a separate assembly with T4/PS/PostSharp and push to NCrunch.
That will do plus gives us an ability to keep original test suits in the Visual Studio. Not bad, actually. We like that separation.

>>At the moment, I know of no tool that will do this without some level of effort required.
Welcome to the blue ocean!

>>This means we can't just duplicate the tests at discovery time inside NCrunch, as this blows away the abstractions built into each test framework.
Understandable.

>>his could be done using Mono.Cecil, or perhaps PostSharp (I am no expert on this tool).
Yes, yes, yes, yes.

Awesome sum up, many thanks for the support!
1 user thanked avishnyakov for this useful post.
Remco on 7/13/2015(UTC)
Remco
#7 Posted : Monday, July 13, 2015 6:31:31 AM(UTC)
Rank: NCrunch Developer

Groups: Administrators
Joined: 4/16/2011(UTC)
Posts: 7,000

Thanks: 932 times
Was thanked: 1259 time(s) in 1172 post(s)
Thanks for your understanding on this one :)

It may take a little tinkering to get this to work in the post build event. You can use an NCrunch-specific override in your build to restrict the behaviour so it only appears within NCrunch (see http://www.ncrunch.net/documentation/troubleshooting_ncrunch-specific-overrides for more information on this).

The value of this is astronomical as it will allow you to cleanly test your application in real-time across a wide range of different platforms, with the ability to easily scale upwards with more servers as the size of your test suite increases. The ability to show inline coverage for individual platforms would also give you some ability to debug your code on remote systems. I'm excited to hear how you go with it, and I think that others will also be very interested too!
1 user thanked Remco for this useful post.
avishnyakov on 7/13/2015(UTC)
Users browsing this topic
Guest
Forum Jump  
You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.

YAF | YAF © 2003-2011, Yet Another Forum.NET
This page was generated in 0.090 seconds.
Trial NCrunch
Take NCrunch for a spin
Do your fingers a favour and supercharge your testing workflow
Free Download