markcmiller86 commented on issue llnl/Silo#537.
Yes. But, don’t also enable BUILD_TESTING. I am seeing a CMake issue when that is enabled. And, of course, you can’t have the python interface in a static build either….
becker33 commented on issue spack/spack-packages#4076.
Re-ran cI jobs that failed for CI-system issues, they all pass. …
grondo commented on issue flux-framework/flux-security#219.
Thanks! Setting MWP….
garlick commented on issue flux-framework/flux-core#7500.
After discussion with @trws and some reflection on #7496 , it seems like the existing python schedulers should be reusable with externally provided ResourcePool implementations. I’ll change the “rack” test scheduler proposed here to be a test RackPool class instead, and then make it possible to override the pool class from the Scheduler base class (via module option and via R.scheduling.writer URI key)….
grondo commented on issue flux-framework/flux-core#7528.
> But, given your explanation, my sample script should work, right?…
markcmiller86 commented on issue visit-dav/largedata#5.
Thanks for the report and the fix :muscle:…
imguoguo closed issue sipeed/picoclaw#2427.
[BUG] WebUI cannot show both of content and tool_call at the same time now.
By default, the WebUI will show the tool_call only. After refreshing the page, it will show the content only….View Comment
vsoch commented on issue singularityhub/guts#10.
We would need a SingularityContainer class akin to the DockerContainer class that exposes the same function signatures, and I think that would do it. Knowing Singularity images, the issue we might run into is with respect to speed to do the same operations, but it is worth a test! If you are interested in taking a shot, the main entrypoint creates a ManifestGenerator that is going to call extract on that class. That happens here:…
v-dobrev commented on issue mfem/mfem#5251.
This PR is now under review (see the table in the PR description). To help with the review process, please do not force push to the branch….
JustinPrivitera commented on issue llnl/conduit#1358.
Updated strawman from my talk: …
garlick commented on issue flux-framework/rfc#452.
Great point about the single core jobs. I am flip flopping on the template idea today and going around in circles I’m afraid….
sam-maloney commented on issue flux-framework/rfc#452.
I’m going to start working through the jobspec examples from RFC 14 to see what comes up (giving just the resources key, and essentially arbitrary assignment of ids):…
trws commented on issue flux-framework/flux-sched#1424.
I’m digging into this now, and I’m pretty sure I have a handle on what’s going on. Fluxion sets the expected start time as an annotation, and sends that along to core. Core interprets time of now or earlier as “now.” Both of these are fine independently, but sched tries to avoid sending too many annotations because of some old issues with memory ballooning due to too many annotations….
garlick commented on issue flux-framework/flux-docs#317.
Great improvement!…
chu11 commented on issue flux-framework/flux-core#7528.
> Yes, that’s pretty much what I want. But, doesn’t flux batch require flux start to be called prior to make the resources available? How is it seeing the resources?…
cmoussa1 commented on issue flux-framework/flux-accounting#846.
Thanks @jameshcorbett!…
hppritcha commented on issue open-mpi/ompi#13713.
@devreal could you re-review this PR?…
jandrej commented on issue mfem/mfem#5296.
What part of the dolfinx assemble process are you timing? Can you please post the source including the regions and timings you get?…
jameshcorbett commented on issue flux-framework/flux-docs#315.
LGTM as well, so I hit the button….
wihobbs commented on issue flux-framework/flux-docs#310.
Thanks @garlick! I also added a link to the tracking issue for multi-cluster accounting, instead of just “needs design.” I’ll set MWP….
wihobbs commented on issue flux-framework/flux-core#7252.
Interestingly, flux-sched’s CI appears to be catching a legitimate error not caught by core:
…
cbritopacheco commented on issue cbritopacheco/rodin#226.
@copilot Revert back to commit 9871634…
jeremylt commented on issue CEED/libCEED#1950.
For sure. If we see a performance difference, then I think the way to go for ChipStar would be to make chipstar backends /gpu/hip/chipstar/shared and /gpu/hip/chipstar/gen that delegate back to the current shared/gen code and that code would check the resource string for the root /gpu/hip/chipstar to determine if it needs to do the padding elements…
hppritcha commented on issue spack/spack-packages#4163.
please address others’ comments and I’ll review….
hppritcha open issue mpi-forum/mpi-issues#1107.
MPI_Type_commit - should it be const?
# Problem…View Comment
balos1 commented on issue llnl/sundials#898.
> This is for version 7.6.0!…
btalamini open issue llnl/smith#1581.
Axom codevelop build problem on mac
Since the switch to c++20, building Smith on my Mac fails due to a non-compliant c++20 problem in Axom, in particular with sol….View Comment
artv3 commented on issue llnl/benchmarks#56.
Inconsistency between “non-RAJA variants” and “Base variants” as well…
mplegendre commented on issue llnl/Spindle#172.
Looks good. Thanks….
jameshcorbett commented on issue flux-framework/flux-accounting#845.
Nice!…
benzwick closed issue benzwick/guix-mvox#3.
Guix Systole CI: channel fails to compile against current Guix
The Guix Systole and Guix Slicer CI workflows fail because the guix-systole channel modules don’t compile against current Guix during guix pull….View Comment
benzwick commented on issue SystoleOS/guix-systole#102.
Confirming this is resolved
tgamblin commented on issue spack/spack-packages#3962.
> @tgamblin is suggesting we move all Python-related packages to a separate py namespace such that py.ruff and ruff both work. Maybe we should hold off on the rename for now? …
milroy commented on issue flux-framework/flux-sched#1443.
I noticed similar behavior when implementing PR #1316. Here’s the commit where the interner storage is opened and closed (fixed the UB I was observing in the unit tests): https://github.com/flux-framework/flux-sched/pull/1316/changes/1c2a56c24262312eacf62dd8041ba1fe6e4b7f51#diff-d55e8cb6b34796f9929f4d3cd51b201e5bc6dea64e17ab617ded2c4af43b81d8R769-R770 …
tgamblin commented on issue spack/spack-packages#3200.
Closing this as we do not plan to implement it. We may make stage names configurable as in https://github.com/spack/spack-packages/pull/3200#issuecomment-3821996900, which would at least give affected people an easy out….
imguoguo commented on issue sipeed/picoclaw#2236.
gardner48 commented on issue llnl/sundials#900.
The last commit updates the level 2, 4, and 5 logging outputs to match the output on the Jenkins test machine. The Jenkins PR tests use logging level 3 and the release tests use logging level 2 (the default level), so outputs for levels 2, 4, and 5 were not updated in previous PRs and a difference in the level 2 output was causing one test to fail. …
cbritopacheco commented on issue cbritopacheco/rodin#217.
@copilot+claude-opus-4.6 Please proceed auditing, fixing, adding, implementing, correcting the batches you listed, updating the checklist….
JustinPrivitera commented on issue visit-dav/visit#20891.
I see, but this is only needed for containers? Or will we need this on LC?…
cyrush commented on issue visit-dav/visit#20903.
window is blank (as expected) for both standard and ospray mode, anari is the only mode we have an issue…
markcmiller86 commented on issue visit-dav/visit#20897.
@cyrush I think this is ready now….
gardner48 commented on issue sundials-codes/answers#72.
Re-targeting this PR to main for llnl/sundials#897…
v-dobrev commented on issue mfem/mfem#5246.
Re-merged in next for testing……
jameshcorbett open issue flux-framework/flux-sched#1443.
Updating Fluxion to v0.50.0 causes interner to abort
On Hetchy, I updated the flux-sched RPM and then ran the following:…View Comment
grondo commented on issue flux-framework/flux-sched#1444.
@mergifyio refresh…
trws commented on issue flux-framework/flux-sched#1443.
This means somewhere we have a top-level entrypoint that isn’t catching exceptions to avoid termination. That needs to be fixed….
grondo commented on issue flux-framework/flux-core#7501.
Thanks @trws and @wihobbs for your comments. After further inspection, there’s some outstanding ambiguities in how CLI plugin options and dest work that this implementation makes worse:
…
wihobbs commented on issue flux-framework/flux-core#7501.
Added to the scope of #6980…
trws commented on issue flux-framework/flux-core#7501.
Sounds good to me, @wihobbs would you create something?…
garlick commented on issue flux-framework/flux-core#7513.
Fixed a typo in the commit message and setting MWP, thanks….
jameshcorbett commented on issue flux-framework/flux-coral2#466.
Oh ok that’s very helpful actually. What’s happening is that Flux is trying to watch the storages resource in K8s for updates about rabbit status. It asks k8s to only see recent changes, where “recent” is defined by passing in an identifier string grabbed from one of the storages resources. However, kubernetes is rejecting the string as being too old–I guess there is a storage resource on elcap somewhere that hasn’t been updated in a long time. As a fallback, with Kubernetes rejecting the watch request, Flux asks to get the current status of every single rabbit on the system. That takes a long time to process. Then Flux tries to watch changes to the storages resources again with a new identifier string (and hopefully a more recent one that kubernetes will accept). But no–for whatever reason, the identifier string is rejected once more, and the cycle begins again. In effect, Flux asks to get the current status of every single rabbit on the system every five seconds. This generates a ton of traffic and log messages….
cmoussa1 commented on issue flux-framework/flux-coral2#467.
What about raising a job exception on those rabbit jobs with this descriptive error message? I am not sure if that is any better from the user or admin’s perspective if the job has an exception raised on it, but would that prevent them from having to a) kill the jobs manually, or b) restart a Flux instance entirely? And instead you could just reload the plugin after all of the rabbit jobs have finished cleaning up?
cmoussa1 commented on issue flux-framework/flux-accounting#840.
Thanks for giving this a look @grondo! I just force-pushed up a couple fixes based on your suggestions above. I also added a new set of tests to check that with two queues a job held in queue A due to a max SCHED limit is not released when a job in B transitions to RUN state. I went ahead and appended that set of tests to the sharness test file….
jeremylt commented on issue CEED/libCEED#1830.
I would recommend this order, personally…
vladotomov commented on issue CEED/Laghos#209.
@amroakmal this looks good, thank you….
cyrush closed issue Alpine-DAV/ascent#1686.
Threshold filter on point meshes crashes due to CleanGrid merging coincident points
Description:…View Comment
emily-howell commented on issue Alpine-DAV/ascent#1663.
@cyrush I went through and addressed all of your feedback. on all three branches. I tried to keep things still relatively with the original changes to keep things less noisy. I did update the documentation in #1682 to reflect the changes that I made so that might be a good place to start reviewing from….
jeremylt commented on issue openjournals/joss-reviews#10237.
Unfortunately, I am not available until August…
artv3 closed issue llnl/RAJAPerf#578.
FEM Kernel Update
- [x] Add an atomic variant for the mass PA kernel …View Comment
adrienbernede commented on issue llnl/RAJAPerf#529.
Replaced by #638 …
trws commented on issue llnl/RAJA#2009.
As a stylistic note, we do this in OpenMP with single if the other threads should wait, or with masked or the deprecated master if they shouldn’t, while once is used by C++, C, and posix to mean “run exactly one time, no matter how many threads encounter this, and block all encountering threads until that one run is done”.
…
artv3 commented on issue llnl/RAJA#1917.
@llnl/raja-core this may need another review. …
adrienbernede commented on issue llnl/RAJA#1999.
@rhornung67 this should really be merged: it will update the toolchains and reduce spack concretization times….
milroy commented on issue flux-framework/flux-sched#1432.
@jameshcorbett the final issue to address is the question of the scope of the change of token to sequence_number (i.e., everywhere in the code, just the readers, or postpone the renaming). Once we settle that I’ll approve the PR….
vsoch commented on issue flux-framework/flux-docs#313.
https://github.com/jbms/sphinx-immaterial/issues/485…
chu11 commented on issue flux-framework/flux-core#7493.
thanks, will set MWP…
benzwick closed issue benzwick/guix-mvox#2.
Error when building mvox using guix
Build failed after executing guix install mvox…View Comment
alecbcs commented on issue spack/spack-packages#4060.
I originally stayed away from using urls because a package cannot have both url and urls properties and there are some r-* packages that set a url, however since we gate populating the urls on the cran class attribute I think a package setting both a cran identifier and a custom url is reasonably expected to fail….
tgamblin commented on issue spack/spack#52167.
If this is going to be essentially the same as spack compiler find, should we document that clearly? Is spack compiler find a convenient alias for this, or is this an alias for spack compiler find? I think probably spack compiler find should be documented to say it’s an alias for this….
jedbrown commented on issue openjournals/joss-reviews#9333.
@editorialbot generate pdf…
hppritcha commented on issue open-mpi/ompi#13801.
Thanks for the heads up. I don’t think any of our test systems have such a new version of pbspro. In fact, only system I have access to that runs pbs (aurora) runs a very old version of that software. looks like …
tzanio commented on issue mfem/mfem#5291.
ping: @stefanozampini …
JustinPrivitera closed issue llnl/conduit#1132.
is this multi-buffer material set example correct?
``` …View Comment
BradWhitlock commented on issue llnl/axom#1836.
Closing this PR as it was only for reviews….
milroy commented on issue flux-framework/flux-core#7439.
Thanks for the feedback! Setting MWP….
vsoch commented on issue vsoch/pipelib#13.
@matthiask thank you for this PR! I have a few deadlines into April but I’ll have time to look after that….
camierjs open issue mfem/mfem#5247.
Test failure for `ex1p_ceed_np=4`
Thank you for developing and maintaining this project….View Comment
benzwick closed issue benzwick/StreamVorti#45.
Parallel DCPSE produces partition-dependent results for 2D block partitions (np=4)
## Problem…View Comment
garlick commented on issue flux-framework/flux-core#7494.
Reproduced:…
vsoch open issue converged-computing/mcp-server#10.
Features to add for more production robustness
When we have this deployed in a production setting, I want:…View Comment
markcmiller86 commented on issue visit-dav/visit#20892.
Ok, I built noise.C on current develop against Silo 4.12 and Silo 4.10.3 and then ran both versions of noise creating PDB files….
cyrush commented on issue visit-dav/visit#20892.
I think maybe its an issue with looping over the dims…
imguoguo commented on issue sipeed/picoclaw_docs#18.
LGTM…
v-dobrev commented on issue mfem/mfem#4917.
Merged in next for testing……
alecbcs commented on issue spack/spack#52144.
> I think putting new commands we just happen to want in CI under spack ci is not really the path forward. This is going to operate on mirrors, considering the state of a repo. Neither of those things has anything to do with “ci” which is really just “build farm scheduling”. …
cjvogl commented on issue mfem/mfem#5270.
I just ran the code and everything is looking good! All that is left is to address the last syntax concerns noted above….
artv3 commented on issue mfem/mfem#5124.
Any comparisons with MFEM’s AMGX backend? https://github.com/mfem/mfem/blob/master/examples/amgx/ex1p.cpp Super cool! …
mplegendre commented on issue llnl/Spindle#148.
I went over the /bin/.. case in detail, and it looks like we handle it okay. There’s definitely some oddities with spindle putting “..” into an internal variable named ‘file’. But we ultimately always supported spindle operations on directories (even if that wasn’t common), and “..” just gets treated like other directories and handled correctly. …
markcmiller86 commented on issue llnl/Silo#527.
@biagas this is how I’d like to proceed with Silo. That said, if you feel better about your original patches for bv_silo.sh because you have tested those changes in multiple places, then I am ok with keeping your bv_silo.sh logic as is….
garlick commented on issue flux-framework/rfc#505.
Eh this was prematurely posted - sorry. Closing for now….
jameshcorbett commented on issue flux-framework/flux-sched#1432.
Aside from 229e25d, there are no code changes from @milroy ‘s last review. However, I did add some new tests in the last commit. The tests fail because the agfilter output is incorrect for cores: it considers fewer cores to be allocated than are in fact allocated. This discrepancy appears to have something to do with the match format and the way it skips vertices, because if the test is re-run with rv1 match format instead of rv1_shorthand, or if the job is modified to only request nodes non-exclusively, the agfilter checks pass. …
sam-maloney commented on issue flux-framework/flux-sched#1384.
I’m going to jump on this thread to add that I’ve also been getting some timeouts recently on one of my builds. In particular, t1028 was consistently timing out, and t2317 did once. As soon as I removed the timeout threshold t1028 passed consistently….
jameshcorbett open issue flux-framework/flux-core#7494.
idset: `idset_set (idset, -586)` caused `SIGABRT`
As part of https://github.com/flux-framework/flux-sched/pull/1438 I tried passing some negative integers to idset_set and found that passing -586 (which I picked randomly) caused my process to be killed with SIGABRT. I noticed the following assertion message in my test output:…View Comment
grondo commented on issue flux-framework/flux-core#7495.
I’d just add a note in the documentation. Something like…
wihobbs commented on issue flux-framework/flux-core#7495.
I guess an edge case might be “what if the hostfile for the taskmap specifies fewer hosts than the -N request?” which is currently allowed….
jameshcorbett commented on issue flux-framework/flux-coral2#463.
Similar to your PR this also needs a manual merge @cmoussa1 …
cmoussa1 commented on issue flux-framework/flux-accounting#836.
Thanks @jameshcorbett! Setting MWP here…
JustinPrivitera closed issue visit-dav/visit#20846.
numpy issue on LC toss4
Even though default compiler on lc systems is gcc 10, numpy will look for ccache, which in a default env ends up using gcc 8.x. …View Comment
becker33 closed issue spack/spack#51971.
Mirroring packages that throw an error breaks mirroring
### Steps to reproduce…View Comment
hppritcha commented on issue open-mpi/ompi#13790.
okay this is curious. The “problem” seems to be in the ompi error checking functions in cases where the send/recv count value is 0. This is not really a fortran interface problem….