Sudden lurch in remaining work display

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5845
Credit: 109965275845
RAC: 30705544

The Server Status page has

The Server Status page has been showing around 4 days or so remaining work (I didn't pay attention to the precise figures) but this morning it seems to be above 5 days which suggests that incremental additions to the stock remaining are possibly being made. Perhaps testing of small numbers of new +800 tasks??

Anyone been paying better attention to the figures and perhaps can confirm this?

Cheers,
Gary.

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4273
Credit: 245211351
RAC: 12943

Update: We started to "drain"

Update: We started to "drain" the current S5R3a workunit generator, i.e. it will generate all the workunits below ~799Hz that have not yet been generated, put them into the database and then terminate.

You should see a fast decrease in the "Work remaining", a fast increase in the "Workunits in database" and "Workunits with no canonical result" values on the Server status page, and finally find the "Einstein S5R3 generator" "Not running" any more.

This will allow us to start the new "S5R3b" workunit generator some time tomorrow. Hopefully everything goes smooth enough that people not reading the forums won't even notice that there is a transition. The Apps (Beta-, power- and standard Apps) will stay the same, so no need to do something there.

BM

BM

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4273
Credit: 245211351
RAC: 12943

We started to send out the

We started to send out the first (few hundred) "upper-half" S5R3 tasks for testing. For the curious: The task names end in "S5R3b", and the data files in "S5R3". The Delay Bound ("deadline") has been increased to 18 days.

BM

BM

Richard Johnson
Richard Johnson
Joined: 9 Feb 05
Posts: 5
Credit: 18792
RAC: 0

To respect to the 800 hz,

To respect to the 800 hz, given the creditals of the format for recieving credits for wu's. I had believed that in the interim, computer that runs works under the assumption of half time. Meaning that if the work takes longer the credit should not be any different then if it took a short time. I believe that the wear and tear on the cpu in which its life expectancy deminishes. For the lack of a better word. The cpu dies out due to the excessive work load that it endures through processing data.

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Moderator
Joined: 28 Aug 06
Posts: 3522
Credit: 689108548
RAC: 212696

RE: To respect to the 800

Message 77644 in response to message 77643

Quote:
To respect to the 800 hz, given the creditals of the format for recieving credits for wu's. I had believed that in the interim, computer that runs works under the assumption of half time. Meaning that if the work takes longer the credit should not be any different then if it took a short time. I believe that the wear and tear on the cpu in which its life expectancy deminishes. For the lack of a better word. The cpu dies out due to the excessive work load that it endures through processing data.

Hi!

The fact that the old near 800 Hz units ran twice as fast was a special effect, so it was not reflected in the credits for WUs. Now the runtime is back to "normal" and all should be fine.

As to wear and tear of the CPU: I would not be concerned about this, CPUs are designed to run on max load for years and years (if they run within the specified limits, overclocking is a different story, of course). Most components are stressed more when switching the system on and off, so 24/7 operation or a continuous high-load operation should not lower the lifetime of a CPU below the time you usually expect to have a CPU in operation (few of us use a CPU build 10 years ago, and we won't use our current CPUs in 10 years).

There are some moving parts like fans and disk drives that might show a lower life expectancy from BOINC, tho.

If you see it from an economical view, the cost of wear-and-tear should be insignificant anyway compared to the additional energy costs, so wear and tear is a non-issue, I guess.

CU
Bikeman

archae86
archae86
Joined: 6 Dec 05
Posts: 3145
Credit: 7057014931
RAC: 1618005

RE: As to wear and tear of

Message 77645 in response to message 77644

Quote:
As to wear and tear of the CPU: I would not be concerned about this, CPUs are designed to run on max load for years and years (if they run within the specified limits, overclocking is a different story, of course). Most components are stressed more when switching the system on and off, so 24/7 operation or a continuous high-load operation should not lower the lifetime of a CPU below the time you usually expect to have a CPU in operation (few of us use a CPU build 10 years ago, and we won't use our current CPUs in 10 years).


I was a reliability guy for a major semiconductor manufacturer for four years around 1990, and have had some further contact on this subject since. Your advice to users here differs from my understanding of the matter.

Actually, these days the in-service reliability goal is set for the distribution of expected operating conditions, not on the worst-case assumption that all in-service parts see worst-case conditions. If everybody re-wrote their flash card at the maximum feasible rate, many more would fail far sooner than the goals. If everybody operated their CPU at 100% utilization with poor cooling, the fleet CPU failure rate would be much higher than the requirement.

Hotter is worse, and higher voltage is worse, though the degree to which these two things hurt your chances varies with mechanism.

Thermal cycling of a CPU to the degree presented by switching a system off and on is an utterly negligible stress. There have been cases (usually involving thin-film compatibility issues) of component/package combinations with appreciable thermal cycling failure rates stemming from delamination, but the accumulated harm varies as a quite high power of the cycling range (something like sixth power, if I recall a paper my colleague Rich Blish presented on the subject), and the range for desk-top CPUs is just not much.

In summary, yes, you are raising the probability of failure of your CPU at any given moment (including the first minute after start) by running BOINC applications in time which would otherwise be idle. You are raising it further if you increase power consumption and temperature by overclocking. You are raising it further if you raise the CPU voltage. You are raising it further if you operate the PC during hours in which you would have shut it down.

I do agree that there are probably components of the PC which don't like the system being powered up and down, but CPU failure probability is not likely in that category.

So after all that negativism, let me switch sides and say:
1. You are far more likely to have your system fail from fouled-up software than from any hardware problem.
2. Among hardware problems, last time I saw the data, hard drive failures and monitor failures are considerably more common than failures in the CPU/motherboard parts of the system.
3. So if you don't carry overvoltage overclocking to extremes, and assure decent cooling, I don't think your extra risk is troublingly high.

Erik
Erik
Joined: 14 Feb 06
Posts: 2815
Credit: 2645600
RAC: 0

RE: snip 2. Among hardware

Message 77646 in response to message 77645

Quote:

snip
2. Among hardware problems, last time I saw the data, hard drive failures and monitor failures are considerably more common than failures in the CPU/motherboard parts of the system.
/snip

I cannot comment on the monitor bit (I shut mine off between sessions usually) but I can attest to the HDD failures/BOINC issue. I replaced one a few months ago that had been in use for about 4 years (crashed hard) and in another box I have one (2.5 years) that is showing the Spin Retry Count to be out of it's acceptable threshhold (debating whether to do a HD swap between boxes because the oldest one is going to the kids shortly). It is possible that both is a case of "averages" but none of my previous computers burned through their HDDs in their lifetimes (5+ years with no BOINC). IMO, the cost of an equivalent HDD is quite reasonable though lost data if unrecoverable from a crash can be bad thing (repeat mantra "backup data, backup data")

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.