Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

3.8.2.8 #144

Merged
merged 10 commits into from
Sep 24, 2024
Merged

3.8.2.8 #144

merged 10 commits into from
Sep 24, 2024

Conversation

PalNilsson
Copy link
Collaborator

  • Alternative stage-out
    • Additional stage-out attempt for failed transfers (data and log files) to different storage if configured in astorages
    • Being discussed in JIRA ticket ATLASPANDA-994 “Failover stage-out to write_lan/1 RSE”
    • Pull request #142
  • Improved IPv6 address extraction
    • Problem with pattern recognition seen with Alma9 at Wuppertal
    • Reported by T. Harenberg
  • Replaced remaining ps command usage with a call to the more efficient psutil python module
    • A problem during high CPU load was seen on an ARM resource (with 256 cores) due to too many concurrent ps processes (KIT)
    • Reported by M. Schnepf
  • Now possible to set real-time logging server via PQ.catchall
    • Requested by I. Vukotic
  • Bug fixes
    • Fixed typo in infosys initialization (harmless)
      • Pull request #143
    • Improved exception handling during randomization of panda server address
      • Added socket.gaierror, “Name or service not known”
      • Could otherwise lead to lost heartbeat, although problem only seen in jobs that had already failed with sending job updates to server

Contributions from A. Anisenkov, P. Nilsson

@PalNilsson PalNilsson merged commit af45e37 into PanDAWMS:next Sep 24, 2024
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant