Report deadline: 1 week!

Thierry Van Driessche
Thierry Van Dri...
Joined: 9 Feb 05
Posts: 210
Credit: 229929
RAC: 0

> I have something similar

Message 3378 in response to message 3377

> I have something similar myself. I don't pretend to know the exact details
> behind it but I can see from my logs that I have uploaded and downloaded
> several workunits today from the server. What I can see is that, like you, I
> can see two 'work units' in the project folder. They are H1_0367.9 and
> H1_0344.4. All of the WUs I have uploaded and downloaded today start with
> these 'IDs' (actually they are all H1_0344.4__**)....
>
>
> Perhaps someone with a more detailed knowledge is reading these posts and can
> explain it to us?

Somewhere in another threat I saw a response to that: all the WU's we have are coming from one big file. The number of the files downloaded are depending from the resource share setting and the setting of "connect to". All the WU's are extracted from these files. Once a file like that is completely "used", this file is replaced by a new one.

Anybody corrects me if this is not the case.

Greetings from Belgium
Thierry

Ziran
Ziran
Joined: 26 Nov 04
Posts: 194
Credit: 605124
RAC: 686

> > Are you considering to

Message 3379 in response to message 3347

> > Are you considering to give out work to computers grouped bye average
> > turnaround time?
>
> Yes -- in fact I made some modifications to the scheduler to do this. But
> then David Anderson (chief BOINC architect) persuaded me that what I had done
> was not a good idea. So I backed it out. I'll probably come back to it in
> the future.

What i was thinking of was to try to find a way to divide all computers in to two gropes that would crunch an equal amount of WU / time period, with all computers with the faster turnaround times in one grope and the slower in the other. To do this i would try to find out what the mean turnaround time for all the WU’s returned say in the last 7 days was, lets call this value T. Now then a computer would need WU’s from a new data set i would let it download data set A if it had a faster turnaround time then T and data set B if it was slower. If it were a new computer (turnaround time 0.00) i would give it the data set B. In my mind this wouldn’t put to much stress on the servers since it would only be used every time an computer would need a new data set and the value T would only be needed to be calculated say once a day.

Then you're really interested in a subject, there is no way to avoid it. You have to read the Manual.

Ziran
Ziran
Joined: 26 Nov 04
Posts: 194
Credit: 605124
RAC: 686

> > Then the scheduler have

Message 3380 in response to message 3347

> > Then the scheduler have to resend a WU, is it sent to the first
> available
> > computer or do you have preferences for who gets whose?
>
> During 3.5 days after creation, the WU goes to the first machine which has the
> right data file, that needs work. During the next 3.5 days, it also has some
> probability of being picked up by a machine that has NO data files. Finally,
> after 7 days, it goes to the first machine that requests work, which has a
> broadband (>100 kB/sec) network connection.

So if i would attach a new computer i would only get a resent WU as a first WU if the scheduler had been trying to send it out for at least 3.5 days. The reason i asked was because then i attached to SETI a couple of months ago the first WU i got was i WU that was resent out. Somehow this didn’t feel right, because i think the probability that something goes wrong is greater with the first WU downloaded.

Then you're really interested in a subject, there is no way to avoid it. You have to read the Manual.

Martin P.
Martin P.
Joined: 17 Feb 05
Posts: 162
Credit: 40156217
RAC: 0

> Here's a link to a E@H

Message 3381 in response to message 3368

> Here's a link to a E@H participant's CPU that is a perfect example of why I
> think the deadlines should be shorter, not longer. What’s the deal
> with this?
>
> Case
> study for shorter deadlines!

>

This is a perfect example why deadlines should be longer! The same will happen to the WUs I downloaded. I attached to the project with SETI@Home Preferences set to connect every 2 days. E@H downloaded 17 WUs. Running my computer 10 hours a day and doing some work beside E@H I can return approximately 2-3 WUs/day. Crunching Monday through Friday makes this 10-15 WUs (in Europe electricity costs money!). Therefore 2-7 WUs will time-out. When I go on holidays for 1 week all WUs will time-out. Is that the idea?

Captain Avatar
Captain Avatar
Joined: 19 Jan 05
Posts: 168
Credit: 321252
RAC: 0

Martin, the settings should

Message 3382 in response to message 3381

Martin, the settings should be at connect every .05 days
and manually connect for more WU's.

The example listed returned two or three units compaired to the dozens not returned.

It's obvious that this unit is for work or other duties and then the unit is shut down when not in use.

This should be a case study on what not to do.......

Tim

If you need an Avatar, I'll create one for you!
Captain.Avatar-[at]-Gmail.com

adrianxw
adrianxw
Joined: 21 Feb 05
Posts: 242
Credit: 322654862
RAC: 0

I joined E@H today. It seems

I joined E@H today. It seems to have downloaded just a single wu.

I think BOINC is missing several things right now, 2 I'd like to see soon are the old "stop_after_send.txt" mechanism whereby a participant can stop receiving units if desired, without detaching from the project and losing the current wu. Sometimes, I want to concentrate on a particular project for my own reasons, normally simple competitive issues.

Another nice feature would be to have some manual control over the scheduling - it may help the situation being described here. If I can see that a wu is getting close to it's deadline, I could flag CPDN/S@H/P@H/LHC as "disabled" at the client level. I realise I can go to the websites and adjust all the percentages, but that is a pain if like me, you participate in all the projects. Having this does not solve the problem above, since suspending a project at the client level would simply risk running it past it's deadline which is of no use to anyone.

Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

Aurora Borealis
Aurora Borealis
Joined: 11 Feb 05
Posts: 19
Credit: 5657955
RAC: 0

> I joined E@H today. It

Message 3384 in response to message 3383

> I joined E@H today. It seems to have downloaded just a single wu.
>
> I think BOINC is missing several things right now, 2 I'd like to see soon are
> the old "stop_after_send.txt" mechanism whereby a participant can stop
> receiving units if desired, without detaching from the project and losing the
> current wu. Sometimes, I want to concentrate on a particular project for my
> own reasons, normally simple competitive issues.
>
> Another nice feature would be to have some manual control over the scheduling
> - it may help the situation being described here. If I can see that a wu is
> getting close to it's deadline, I could flag CPDN/S@H/P@H/LHC as "disabled" at
> the client level. I realise I can go to the websites and adjust all the
> percentages, but that is a pain if like me, you participate in all the
> projects. Having this does not solve the problem above, since suspending a
> project at the client level would simply risk running it past it's deadline
> which is of no use to anyone.
>

Some, if not all, these features are in the Alpha of Boinc CC now being developed.

Questions? Answers are in the BOINC Wiki.

Boinc V6.10.6 Alpha Test
WinXP C2D 2.1G 3GB

Allan Taylor
Allan Taylor
Joined: 4 Jan 05
Posts: 4
Credit: 145814
RAC: 0

For the features you are

For the features you are requesting, you should check out Boincview. I am using the latest release candidate for the new core clients (Version 1.0 RC5). I never run the Boinc manager, it has a long way to go to come close to what Boincview is capable of.

You can download it here.

[img]http://www.boincstats.com/stats/banner.php? cpid=19ffeb205afac117477975798c4ada3f[/img]

Bruce Allen
Bruce Allen
Moderator
Joined: 15 Oct 04
Posts: 1119
Credit: 172127663
RAC: 0

This is indeed a very clear

Message 3386 in response to message 3367

This is indeed a very clear analysis of the situation.

> So what could be done to reduce the size of the database? If i am correct, the
> database that we are talking about hear are the database that consists of
> entries of what host have downloaded what WU. Then you download a WU an entry
> is made in the database. This entry remains in the database until all other
> users who have downloaded the same WU have bean accounted for or the deadline
> for unaccounted users have bean reached. My question is, are the entries in
> the database removed a.s.a.p. or is the removal delayed a couple of days so we
> can look at some past results? If so, i am willing to trade that feature for a
> couple of hour’s extension to the deadline.

I'd appreciate feedback on this. I decided to leave completed WU in the database for 7 days, to let our participants go back and examine them. Reducing this storage time would indeed help to keep the database 'lean and mean'. Would less than 7 days be OK? What do you propose?

Cheers,
Bruce

Director, Einstein@Home

John McLeod VII
John McLeod VII
Moderator
Joined: 10 Nov 04
Posts: 547
Credit: 632255
RAC: 0

> One thing that has become

Message 3387 in response to message 3374

> One thing that has become apparent to me is that people in general, at least
> the people that post here, set their caches to high. I don't know if it is
> motivated by a paranoia about not being able to connect and get more work or
> if it is just an over sensitivity to being out of work for a few minutes while
> their machines connect and get work but I for one want the absolute minimum of
> work units sitting on my hard drives and I have never had a problem of having
> no work.
>
> I think the default of 0.1 days to connect to the network is ideal if you have
> a connection that is always on. If you're crunching 24x7 then maybe 0.5 - 1
> days. If you're dial up, then that is another matter entirely and maybe most
> people are on dial up connections.
>
> There have been a very few instances where one of the 4 projects I'm working
> on couldn't provide me work but that's why there's BOINC in the first place,
> so you can work on multiple projects so this is a non-issue. One project has
> no work, BOINC just rolls to the next in line. On my machines, all projects
> share equally, it makes it so simple, why make it complicated.
>
I hve my queue set to 0.1 days. BOINC insists on downloading work from all projects, even though that overloads the computer to such an extent that deadlines are missed. I do not think that a 0.1 deadline is too long.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.