Windows Beta Test App 4.23 available

Brian Silvers

Joined: 26 Aug 05

Posts: 772

Credit: 282700

RAC: 0

RE: In my eyes it's a

21 Jun 2007 17:32:29 UTC

Message 68373 in response to message 68372

(moderation:

)

Quote:

In my eyes it's a matter of inter-project etiquette to try to align the credits or else credit-inflation would happen in a race to attract more users. If intra-parity cannot be achieved, it would be reasonable to pick the most widely used platform for calibration instead (you have to calibrate the cobblestones to some value, after all). At the moment this would rather be Intel/Win than AMD/Win.

But we are getting off-topic a bit I'm afraid.

As I said before, I'll respectfully disagree about it being off-topic.

Beyond the credit disparity, there is really the more troubling performance/watt penalty. A comparable system to mine running Linux will use around 20% less power for the same work with patched 4.17/4.23, and 40-50% if I were using the non-patched 4.17 app.

As for if the Win/Intel platform is selected for the next round of credit dropping, I won't fuss too terribly, provided that the severe penalty that exists now that directly impacts me is addressed to my satisfaction (within 5% of a comparable Linux system)...

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250193236

RAC: 35038

RE: Did you get enough data

21 Jun 2007 19:39:07 UTC

Message 68374 in response to message 68369

(moderation:

)

Quote:

Did you get enough data to find out if checkpointing has improved?

I know that it has improved, but the situations affected are actually very rare anyway. May add up to 1% of the errors related to checkpointing.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250193236

RAC: 35038

RE: Is it too early to ask

21 Jun 2007 19:45:12 UTC

Message 68375 in response to message 68370

(moderation:

)

Quote:

Is it too early to ask whether some of the results computed so far will have to be recomputed? Sounds like a rather serious problem affecting all platforms.

All platforms are affected, but not all workunits. People haven't reached a consensus yet how many this actually are. My current wild guess would be of an order of a few percent, but the main question is how reliable we can identify the ones affected without completely re-calculating them all.

Brian Silvers

Joined: 26 Aug 05

Posts: 772

Credit: 282700

RAC: 0

RE: RE: Is it too early

21 Jun 2007 20:42:01 UTC

Message 68376 in response to message 68375

(moderation:

)

Quote:

Quote:
Is it too early to ask whether some of the results computed so far will have to be recomputed? Sounds like a rather serious problem affecting all platforms.

All platforms are affected, but not all workunits. People haven't reached a consensus yet how many this actually are. My current wild guess would be of an order of a few percent, but the main question is how reliable we can identify the ones affected without completely re-calculating them all.

BM

There seems to be some thought that Homogenous Redundancy should be turned on. Is it fair to say that this would only mask "the problem" (credit not being granted) from mainly our perspective as volunteers and still leave an actual scientific problem from the science/project side, or would turning on HR work?

Thanks...

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250193236

RAC: 35038

RE: There seems to be some

21 Jun 2007 21:03:01 UTC

Message 68377 in response to message 68376

(moderation:

)

Quote:

There seems to be some thought that Homogenous Redundancy should be turned on. Is it fair to say that this would only mask "the problem" (credit not being granted) from mainly our perspective as volunteers and still leave an actual scientific problem from the science/project side

Yes.

The validation "problem" is only a symptom of an actual "scientific" problem.

The problem was technically a variable that was used uninitialized, but only in some cases. The value of such variables often ends up being zero on Unix-type systems (such as Linux an MacOS, depends also on the compiler and optimization), and some random value on Windows. Unfortunately even zero isn't a valid value from the meaning of this variable, so in these cases actually all results are scientifically invalid.

Brian Silvers

Joined: 26 Aug 05

Posts: 772

Credit: 282700

RAC: 0

RE: RE: There seems to be

21 Jun 2007 21:34:23 UTC

Message 68378 in response to message 68377

(moderation:

)

Quote:

Quote:
There seems to be some thought that Homogenous Redundancy should be turned on. Is it fair to say that this would only mask "the problem" (credit not being granted) from mainly our perspective as volunteers and still leave an actual scientific problem from the science/project side

Yes.

The validation "problem" is only a symptom of an actual "scientific" problem.

The problem was technically a variable that was used uninitialized, but only in some cases. The value of such variables often ends up being zero on Unix-type systems (such as Linux an MacOS, depends also on the compiler and optimization), and some random value on Windows. Unfortunately even zero isn't a valid value from the meaning of this variable, so in these cases actually all results are scientifically invalid.

BM

Did you see my comment about using /RTC if you are using VC++ 2005? That will add run-time checks that can catch uninitialized variables. One other thing it helps with are array OOB conditions (buffer overflow)...

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250193236

RAC: 35038

RE: Did you see my comment

21 Jun 2007 21:49:05 UTC

Message 68379 in response to message 68378

(moderation:

)

Quote:

Did you see my comment about using /RTC if you are using VC++ 2005?

Yes, I did. Actually that's how I found the bug (though we are still using VS ".NET" of 2003), but it was necessary for me, too, to get the Runtime Debugger working for this (and to dig out the right workunits).

Your posts are incredibly helpful, thanks a lot! Are you doing Windows programming for living?

Brian Silvers

Joined: 26 Aug 05

Posts: 772

Credit: 282700

RAC: 0

RE: Are you doing Windows

21 Jun 2007 22:06:34 UTC

Message 68380 in response to message 68379

(moderation:

)

Quote:

Are you doing Windows programming for living?

I'm currently unemployed, and I think that there are likely many others who provided much better help... My only background with C++ was maintaining (not initial development) of a credit card communication DLL that was migrated from C, to C++ (Unix), and then to a Windows DLL. A guy I worked with that is much smarter than I could ever hope to be ended up having to add in TAPI support to it, and so we switched to Debug from Release at that point while he and I worked together to test the change. We ended up shipping the DLL still compiled as Debug with /RTC and wrote output to a text file that we periodically checked for any other issues. We had several overflows and uninitialized variables.

Edit: The overflows and uninitialized variables were in parts of the DLL that used TCP/IP rather than async dialup. We had to get the TAPI functionality out the door as fast as we could, but we had heard / seen instances of the auth crashing or hanging on the TCP/IP side, so we left the checks in and found what was giving us grief. Of course, that had management in knots for a while with the concept of "debug" code running in a live environment... We convinced them that there wasn't a real performance impact, and we hadn't changed anything (yet), so if the auth crashed, it would've crashed anyway.

Brian

Brian Silvers

Joined: 26 Aug 05

Posts: 772

Credit: 282700

RAC: 0

In case anyone notices, my

22 Jun 2007 16:29:35 UTC

Message 68381

(moderation:

)

In case anyone notices, my last two results with 4.23 were faster than the first two. There is a reason for it...the same change as to 4.17 ;-) This indicates that the code is either still called periodically or that path was slowed by something in 4.23, but is still faster overall... I don't have a profiler installed, so it was just a guess that it would cause a change in performance. That was my "interesting" thing I was going to try...

Brian...being "naughty" ;-)

Donald A. Tevault

Joined: 17 Feb 06

Posts: 439

Credit: 73516529

RAC: 0

RE: RE: There seems to be

23 Jun 2007 1:12:57 UTC

Message 68382 in response to message 68377

(moderation:

)

Quote:

Quote:
There seems to be some thought that Homogenous Redundancy should be turned on. Is it fair to say that this would only mask "the problem" (credit not being granted) from mainly our perspective as volunteers and still leave an actual scientific problem from the science/project side

Yes.

The validation "problem" is only a symptom of an actual "scientific" problem.

The problem was technically a variable that was used uninitialized, but only in some cases. The value of such variables often ends up being zero on Unix-type systems (such as Linux an MacOS, depends also on the compiler and optimization), and some random value on Windows. Unfortunately even zero isn't a valid value from the meaning of this variable, so in these cases actually all results are scientifically invalid.

BM

Here's something that I just thought of. . .

While this bug still exists, would it be a good idea to not send out more work units? I know that some folks would complain, but it seems to me that it would lessen the chance of getting bad results.

Windows Beta Test App 4.23 available

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner