Excessive Disk Usage

rroonnaalldd
rroonnaalldd
Joined: 12 Dec 05
Posts: 116
Credit: 537221
RAC: 0

Thanks for opening a Trac

Message 97280 in response to message 97279

Thanks for opening a Trac ticket in that direction. This behaviour is known to me since 5.8.16, resetting Einstein ends with all except the SxRy-files are gone and editing the client_state.xml is not a solution for everyone.

John
John
Joined: 26 Dec 05
Posts: 2
Credit: 24060617
RAC: 0

RE: If you can't afford

Message 97281 in response to message 97278

Quote:
If you can't afford 250MB of disk space, E@H is really not the project for you.


and

Quote:
In any case, with disks being so large and disk space being so cheap these days, do you really need to be concerned about the odd 250MB?


I too am seeing excessive disk usage from E@H. In my case, I allow 1.5 GB for BOINC projects, and E@H is taking up 1.3 GB of that. My concern isn't that it is taking up that much, it is that it is not leaving any room for the other projects I am working on. They can't get more than one task downloaded because you won't free up disk space. I can't even stop getting new tasks to let the others have their chance, because it doesn't free up any space.

Since you are a forum moderator, I am replying as though you are part of the project. If you are not, feel free to forward my comments to somebody who is.

As for being concerned about the odd 250MB (or 1.3 GB in my place), you seem to have forgotten a key fact: I AM DOING YOU THE FAVOR. I am paying for the electricity (for the 3 machines I am running this on at home, that is not trivial), the hardware, and half of the the bandwidth. For my assistance with your project, I expect you to play nicely with others (i.e. don't hog whatever resources I choose to allocate), and be gracious about my gift to you (i.e. don't ask why I'm concerned about the odd 250MB like there's something wrong with that. It's MY 250MB to do with as I please).

I have already quit one project (Milkyway@home) because they set their task due date so soon that they were always running at high priority. Yes, I know the task scheduler takes such abuse into account. It's the principle and the attitude that they "deserve" more priority than other projects. Same principle here, E@H is taking disk space at the expense of other projects.

So if your attitude is that you MUST use that much disk space and CAN'T let other projects have some for their work (note that on my other computer you are getting by with a "mere" 582 MB), I can happily take my i7-965 with GTX260 and other computers to another project where they appreciate what I am willing to share and play nicely with others.

But I'd much rather continue helping you.

John

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Moderator
Joined: 28 Aug 06
Posts: 3522
Credit: 716975188
RAC: 1001220

RE: So if your attitude is

Message 97282 in response to message 97281

Quote:
So if your attitude is that you MUST use that much disk space and CAN'T let other projects have some for their work [...]

YOU are in full control of defining how much disk-space should be consumed by BOINC as a whole, in the BOINC preferences.

As far as I know, the "Resource Share" that you define for a project is not just used to distribute CPU time in a fair way among the projects, it should also play a role in distributing the disk space and prevent some projects form "stealing" space from other projects.

The individual project is unaware of the space consumed by the other projects. It just gets the info "hey, you can you so and so much disk space" by the BOINC client and will use space up to this limit. There is no way to "play nice" on other projects for the individual projects, because BOINC, not the projects, are supposed to be the broker of resources (CPU and disk space).

CU
HBE

Henk Haneveld
Henk Haneveld
Joined: 5 Feb 07
Posts: 18
Credit: 14273835
RAC: 1596

Observation: S5GCE uses 2

Observation:

S5GCE uses 2 sets of datafiles, S5R4 and S5R7. When there are no more results that need a certain set of files the S5R4 files are deleted but the S5R7 files stay on disk leading to a growing number of S5R7 files not in use.

Question:

Are the S5R7 files left behind in preparation for the search that comes after S5GCE or is there a problem with the deletion command?

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5872
Credit: 117361837903
RAC: 35712827

RE: I too am seeing

Message 97284 in response to message 97281

Quote:
I too am seeing excessive disk usage from E@H. In my case, I allow 1.5 GB for BOINC projects, and E@H is taking up 1.3 GB of that. My concern isn't that it is taking up that much, it is that it is not leaving any room for the other projects I am working on.


1.3GB just for E@H seems unusually large but you need to consider the reasons why that much space is being used. At the moment, there are three separate subprojects going on, each of which will be using a significant amount of disk space that can't really be avoided (or even shared for that matter). Those projects are S5R6, S5GCE and ABP2. Each subproject needs separate executables, separate data files and separate tasks.

The first two subprojects use locality scheduling that requires a significant number of large data files to exist on each host before any actual task can be downloaded and started. You cannot do a single task without incurring a significant disk space penalty. If you complete a task and expect to do any more in the future, you can't recover disk space in any meaningful way by deleting files since a request for a single new task will immediately occupy all that disk space once again with the added penalty of new data downloads that could have been avoided.

The ABP2 subproject doesn't use locality scheduling so, with it, your main disk usage comes from the executables and the tasks themselves. Each task consists of four 2MB data files which will be deleted when the result is safely returned. So if you don't keep a large cache of work, you wont use extra disk space for ABP2 tasks.

Quote:
They can't get more than one task downloaded because you won't free up disk space. I can't even stop getting new tasks to let the others have their chance, because it doesn't free up any space.


There is nothing the project can effectively do to make the disk space penalty much smaller. Unfortunately there has to be a transition from the previous S5R6 project to the new S5GCE replacement subproject and during this transition, for a variety of complicated reasons, the disk space used may very well be quite a bit more than double what you would otherwise expect it to be if you were in the middle of a single long run. So all I can do is repeat that if the project is consuming more disk space than you are prepared to allow, perhaps this project is not suited to your requirements. That is not meant as a put down or as a means of driving anybody away. It's just an unfortunate fact of life.

Quote:
Since you are a forum moderator, I am replying as though you are part of the project. If you are not, feel free to forward my comments to somebody who is.


A forum moderator (without any other added 'qualification') is an ordinary volunteer, just like yourself, but with the added responsibility of trying to answer questions and handle matters of decorum when necessary. We also have the ability to communicate with project staff if needed. Whilst we do tend to get advance information when something major is about to change and we can refer specific matters to a 'higher authority', it is expected that we should be resourceful enough to handle most queries and events without 'bothering' the Devs, who are perpetually busy anyway. In this specific instance, a complaint about excessive disk usage is something we should be capable of resolving ourselves.

Quote:
As for being concerned about the odd 250MB (or 1.3 GB in my place), you seem to have forgotten a key fact: I AM DOING YOU THE FAVOR.


I apologise unreservedly if a poor choice of words has given anyone the impression that their contribution is not valued and fully appreciated by the project staff. But let's get one thing perfectly clear. The project doesn't 'owe' you, or me, or anybody else for that matter and we shouldn't expect any 'reward' (other than personal satisfaction) for 'doing them a favour'. This notion that we are doing them a favour and that gives us rights beyond the right to choose to participate or not is a notion that is quite incorrect, in my opinion.

Why do people VOLUNTEER the use of their hardware and agree to pay for the running costs? I would hope that the answer is something along the lines that supporting the discovery of new knowledge through scientific endeavour is something worthy of your donation. What rights do you have if you are not happy with the way the project is using your volunteered resources? Well you certainly have the right not to participate. You don't have a right to require the project to change how it operates.

Quote:
I am paying for the electricity ....
....


Every other volunteer is doing exactly the same.

Quote:
So if your attitude is that you MUST use that much disk space and CAN'T let other projects have some for their work (note that on my other computer you are getting by with a "mere" 582 MB), I can happily take my i7-965 with GTX260 and other computers to another project where they appreciate what I am willing to share and play nicely with others.


Bikeman has given you an answer to your points. I would like to give you my own thoughts as well.

There is a substantial difference between the BOINC framework and all the project apps that use BOINC to share what you are prepared to volunteer in the way of your resources. It is an oxymoron to suggest that one project can decide 'not to play nice' with another. Individual projects are totally unaware of any other projects and what their particular requirements might be. Apart from yourself, the only 'judge, jury and executioner' is BOINC itself so if there are problems in supporting competing projects which are not able to share resources in an equitable manner, there are actually only two things to blame and the projects themselves are not one of them.

The obvious first point of blame is BOINC. Clashes between projects that BOINC doesn't handle well can only be resolved by the BOINC Devs implementing new features and controls into BOINC. This must be quite an arduous and thankless task as the number of BOINC projects and their competing requirements continues to spiral upwards. Contributing on the BOINC alpha mailing list would be a way to have your personal opinions about 'projects that aren't playing nice' shared with people who could actually do something about it.

The other thing to blame is actually the volunteer. I trust you will take these comments as my personal and honest thoughts on the subject and not try to construe them as any sort of attack on anyone, least of all yourself. It really is the responsibility of the volunteer to think through how much they are prepared to donate and to choose their projects carefully. That really does involve a lot of reading and a lot of question asking. It doesn't involve blaming the project for not showing the proper respect.

Take Milkyway as an example seeing as you mention it. It limits your cache of work to 6 tasks per cpu at any one time. It used to have a deadline of three days. The deadline is now 8 days. It supports both CPUs and GPUs where a given task might take several hours on even a modern CPU but only a minute or two on a modern ATI GPU. There were specific and credible reasons given by the project staff for both the 6 tasks and 3 days limits. Many people have complained long and loud about these limits and I clearly remember the complaint about MW hogging the resources by always being in high priority mode. In most cases, this complaint was largely due to the participant being unwilling to compromise on cache setting and/or unable to appreciate that setting a large cache for a project with a 3 day deadline would confuse the hell out of BOINC.

I've been crunching MW tasks for over a year now, initially on CPUs that were too slow for E@H but more recently on ATI GPUs that E@H cannot use. Since ATI support was added to BOINC, I've been able to configure BOINC so that both my projects can coexist quite happily and I never see high priority mode or examples of one project hogging resources. I've always been able to find a way to keep things on an even keel by using the various preferences that BOINC provides. This also includes 'micromanaging' in a relatively minor way to allow MW to share with Collatz as a backup project for the ATI GPUs whilst still allowing a decent cache of work for E@H running on the CPUs. I expect that the micromanaging will disappear when the BOINC Devs get a better handle on how to properly manage GPU support. I don't see any logic in blaming any project for BOINC's limitations.

Quote:
But I'd much rather continue helping you.


I'm sure the project values your long term support from way back in 2005 and I, for one, hope you will continue that support.

Since you linked to your i7, I had a browse through your tasks list and I can see why you would have a high disk usage. I don't think you have any current S5R6 resends left to crunch now but you did have several that are still visible that were crunched over the last week or so. Each of those resends were for a different frequency range so you would have a complete data subset for each single task. Unfortunately, this is what can happen when we are cleaning up the dregs of a long running project that is coming to a close. The scheduler will be advising your client to delete the large number of data files for the different frequency bands when it is sure that they will be no longer required. This is what has to happen with locality scheduling. If you desperately needed to reclaim the disk space immediately, you can take manual action by editing your state file to clean out the references to the old data files and then deleting the old data files themselves. You really need to understand what you are doing for attempting something like this. It's not a trivial exercise and I certainly wouldn't recommend it in normal circumstances.

The safer course of action is to understand the reasons for this temporary excessive disk usage and to make a decision that perhaps you can cut the project a little slack and increase your disk usage limits sufficiently to tide you over the temporary situation. Also, unfortunately it's not going to be quite as temporary as it normally would be. If you check the server status link on the front page, you will notice that the GCE run will only last for another month. At this point there will be another of these pesky transitions with the same set of problems all over again.

Cheers,
Gary.

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5872
Credit: 117361837903
RAC: 35712827

RE: Are the S5R7 files left

Message 97285 in response to message 97283

Quote:
Are the S5R7 files left behind in preparation for the search that comes after S5GCE or is there a problem with the deletion command?


Thanks very much for bringing this to our attention.

I don't know the answer but I suspect it may be a bug as normally all data files get deleted as the tasks for a specific frequency run out. There was a case in the distant past where the h1_* files were being deleted and the l1_* files for the same frequency were being missed. That turned out to be a bug.

I will pass your observation on to the Devs.

Cheers,
Gary.

Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5893653
RAC: 167

RE: There is a substantial

Message 97286 in response to message 97284

Quote:
There is a substantial difference between the BOINC framework and all the project apps that use BOINC to share what you are prepared to volunteer in the way of your resources. It is an oxymoron to suggest that one project can decide 'not to play nice' with another. Individual projects are totally unaware of any other projects and what their particular requirements might be.


While true, I think erinm thinks that since the preferences in the global preferences/computing preferences are the same on all his projects, that all his projects must know about the other projects.

They don't. These preferences are for the managing program, BOINC.
And then if you look at the Disk and memory settings, they are for all of BOINC, for all projects running under BOINC.

These settings work together with the Network usage settings of "Computer is connected to the Internet about every X days" and "Maintain enough work for an additional Y days". Especially with the latter one. If set high, this will be on all the projects you are attached to on the same venue (and if not overridden by local preferences).

I'm wondering why if you have an i7-965 with Hyper Threading on (so 8 CPUs), you 'constrain' BOINC by giving it only 1.5GB total disk space to work with. I can't imagine that you have spent high money on such a CPU, but cheated on everything else and only installed a 20GB drive in the machine. There must be something bigger in there, else you can't even install Vista Ultimate x64 in a reasonable manner.

So what are your settings for the following?
Use at most A GB disk space
Leave at least B GB disk space free
Use at most C% of total disk space

Computer is connected to the Internet about every D days
Maintain enough work for an additional E days

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5872
Credit: 117361837903
RAC: 35712827

RE: Observation: S5GCE

Message 97287 in response to message 97283

Quote:

Observation:

S5GCE uses 2 sets of datafiles, S5R4 and S5R7. When there are no more results that need a certain set of files the S5R4 files are deleted but the S5R7 files stay on disk leading to a growing number of S5R7 files not in use.

Question:

Are the S5R7 files left behind in preparation for the search that comes after S5GCE or is there a problem with the deletion command?


I've passed on your observation as promised, and I've now received some replies. The S5R7 files are meant to be deleted but there is an undue delay to the deletion process that accounts for what you have reported. The files would eventually have been deleted but it might have taken quite a while so it is being treated as a bug and a fix (which will eliminate the delay) is being worked on.

The reason for the problem in the first place is a bit complicated so I'm paraphrasing somewhat and reading between the lines a bit, too. Essentially, the S5R7 files are not being treated exactly the same as the S5R4 files. However, your BOINC client is aware that these files are finished with and will be able to delete them (one at a time as required), particularly if your client starts to see a problem with exceeding disk space preferences. When the BOINC client sees that it needs to make extra disk space available, it's not smart enough in the way it selects files for deletion so it may need a few attempts before it brings the disk usage back within specification. The S5R7 files will be considered for deletion during each of these attempts and do get selected most of the time. So eventually, they will be cleaned up.

When the fix has been decided on, I imagine it will involve deleting the S5R7 files at the same time as S5R4 files, so eventually this will sort itself out without any user intervention being needed. If you wanted to experiment and didn't mind BOINC telling you from time to time that it couldn't get more work because of lack of disk space, you could try carefully reducing your disk space preferences until you got that message. You could then observe if your BOINC client was able to delete enough of your excess S5R7 files to meet the disk space restrictions you have now imposed. I hasten to add that I'm not suggesting that anyone should do this, particularly if you are rather averse to creating self inflicted and unnecessary problems.

Until the fix is in, it might actually be prudent to allocate BOINC a bit more space (rather than less) so that you don't run into limits triggered by this problem of S5R7 files not being deleted in a timely manner. The majority of the hosts I run (all the ones that just crunch) only have 20GB disks (or less) and most of those have both Windows and Linux installed. The Win allocation is 8GB and the balance is Linux. I trust BOINC enough not to waste disk space so I tell it to leave only 0.01GB free and to use at most 95% of total disk space. I don't get problems with disk space limits so far that I've noticed.

Cheers,
Gary.

Henk Haneveld
Henk Haneveld
Joined: 5 Feb 07
Posts: 18
Credit: 14273835
RAC: 1596

RE: RE: Observation: S5G

Message 97288 in response to message 97287

Quote:
Quote:

Observation:

S5GCE uses 2 sets of datafiles, S5R4 and S5R7. When there are no more results that need a certain set of files the S5R4 files are deleted but the S5R7 files stay on disk leading to a growing number of S5R7 files not in use.

Question:

Are the S5R7 files left behind in preparation for the search that comes after S5GCE or is there a problem with the deletion command?


I've passed on your observation as promised, and I've now received some replies. The S5R7 files are meant to be deleted but there is an undue delay to the deletion process that accounts for what you have reported. The files would eventually have been deleted but it might have taken quite a while so it is being treated as a bug and a fix (which will eliminate the delay) is being worked on.

The reason for the problem in the first place is a bit complicated so I'm paraphrasing somewhat and reading between the lines a bit, too. Essentially, the S5R7 files are not being treated exactly the same as the S5R4 files. However, your BOINC client is aware that these files are finished with and will be able to delete them (one at a time as required), particularly if your client starts to see a problem with exceeding disk space preferences. When the BOINC client sees that it needs to make extra disk space available, it's not smart enough in the way it selects files for deletion so it may need a few attempts before it brings the disk usage back within specification. The S5R7 files will be considered for deletion during each of these attempts and do get selected most of the time. So eventually, they will be cleaned up.

When the fix has been decided on, I imagine it will involve deleting the S5R7 files at the same time as S5R4 files, so eventually this will sort itself out without any user intervention being needed. If you wanted to experiment and didn't mind BOINC telling you from time to time that it couldn't get more work because of lack of disk space, you could try carefully reducing your disk space preferences until you got that message. You could then observe if your BOINC client was able to delete enough of your excess S5R7 files to meet the disk space restrictions you have now imposed. I hasten to add that I'm not suggesting that anyone should do this, particularly if you are rather averse to creating self inflicted and unnecessary problems.

Until the fix is in, it might actually be prudent to allocate BOINC a bit more space (rather than less) so that you don't run into limits triggered by this problem of S5R7 files not being deleted in a timely manner. The majority of the hosts I run (all the ones that just crunch) only have 20GB disks (or less) and most of those have both Windows and Linux installed. The Win allocation is 8GB and the balance is Linux. I trust BOINC enough not to waste disk space so I tell it to leave only 0.01GB free and to use at most 95% of total disk space. I don't get problems with disk space limits so far that I've noticed.

Thanks for looking in to this. I have enough free disk space so I will leave things as they are until the devs have come up with a permanent solution.

rroonnaalldd
rroonnaalldd
Joined: 12 Dec 05
Posts: 116
Credit: 537221
RAC: 0

Jord's Trac ticket was

Jord's Trac ticket was closed. :D

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.