Skip to content

Latest commit

 

History

History
173 lines (153 loc) · 9.09 KB

CHANGELOG.md

File metadata and controls

173 lines (153 loc) · 9.09 KB

Change Log

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[2.7.0] - 2021-11-18

Added

  • SOCA installer is managed by CDK (https://aws.amazon.com/cdk/)
  • Enabled full WSGI debug mode for SOCA Web UI
  • Added support for WeightedCapacity enabling add_nodes.py to launch capacity based on vCPUs or cores
  • CDK: Added support for Active Directory via AWS Directory Service
  • CDK: Users can now re-use their existing AWS resources (VPC, subnets, security groups, FSxL, EFS, Directory Services ...) when installing SOCA
  • CDK: Users can extend the base installer with their own code (see cdk_construct_user_customization)
  • CDK: /apps & /data partition can now be configured to use EFS or FSxL as storage provider
  • CDK: Users can now use your own CMK (Customer Managed Key) to encrypt your EFS, FSxL, EBS or SecretsManager
  • CDK: Users can configure the number of NAT gateways to be deployed when installing a new cluster
  • CDK: Users can customize your OpenSearch (formerly Elasticsearch) domain (number of nodes, type of instance)
  • CDK: Users can configure the backup retention time (default to 7 days)
  • CDK: Users can now deploy SOCA in private subnets only
  • CDK: Added support for VPC endpoints creation
  • Users can now specify up to 4 additional security groups for compute nodes assigned to their simulations
  • Users can now specific a custom IAM instance profile for compute nodes assigned to their simulations
  • Deprecated ldap_manager.py in favor of the native REST API
  • Added a custom path for Windows DCV logs
  • Name of the SOCA cluster is now accessible on the Web interface
  • DCV session management is now available via REST API
  • Customer EC2 AMI management is now available via REST API
  • Added job-shared queue enabling multiple jobs to run on the same EC2 instance for jobs with similar requirements
  • Desktops sessions are now tracked on OpenSearch (formerly Elasticsearch) via "soca_desktops" index

Changed

  • Upgraded DCV to 2021.2
  • Upgraded EFA to 1.13.0
  • Upgraded OpenMPI to 4.1.1
  • Auto-Terminate stopped DCV instances now delete the associated cloudformation stack
  • Fixed #55 (bug and bug fix: automatic hibernation (Linux desktops))
  • Prevent system accounts (ec2-user/centos) to submit jobs
  • OpenMPI is now installed under /apps/openmpi
  • Changed default OpenSearch (formerly Elasticsearch) indexes to "soca_jobs" and "soca_nodes" (previously "jobs" and "pbsnodes")

[2.6.1] - 2021-03-22

Added

  • Added Name tag to EIPNat in Network.template
  • Added support for Milan and Cape Town
  • EBS volumes provisioned for DCV sessions (Windows/Linux) are now tagged properly
  • Support for Graviton2 instances
  • Ability to disable web APIs via @disabled decorator

Changed

  • Updated EFA to 1.11.1
  • Updated Python 3.7.1 to Python 3.7.9
  • Update DCV version to 2020.2
  • Updated awscli, boto3, and botocore to support instances announced at Re:Invent 2020
  • Use new gp3 volumes instead of gp2 since they're more cost effective and provide 3000 IOPS baseline
  • Removed SchedulerPublicIPAllocation from Scheduler.template as it's no longer used
  • Updated CentOS, ALI2 and RHEL76 AMI IDs
  • Instances with NVME instance store don't become unresponsive post-restart due to filesystem checks enforcement
  • OpenSearch (formerly Elasticsearch) is now deployed in private subnets

[2.6.0] - 2020-10-29

Added

  • Users can now launch Windows instances with DCV
  • Users can now configure their DCV sessions based on their own schedule
  • Users can stop/hibernate DCV sessions
  • Users can change the hardware of their DCV sessions after the initial launch
  • Admins can create DCV AMI with pre-configured applications
  • Added support for DCV session storage. Upload/download data to SOCA directly from your DCV desktop (C:\storage-root for windows and $HOME/storage-root for linux)
  • Admins can now prevent users to download the files via the web ui
  • SOCA automatically enable/disable EFS provisioned throughput based on current I/O activity

Changed

  • Removed deprecated soca_aws_infos hook
  • Fixed an issue that caused the web interface to become unresponsive after an API reset
  • Users can now easily import/export application profiles
  • Fixed an issue that caused Nvidia Tesla drivers to be incorrectly installed on P3 instances
  • Manual_build.py now automatically upload the installer to your S3 bucket
  • Upgraded to PBS v20
  • Upgraded DCV to 2020.1-9012

[2.5.0] - 2020-07-17

Added

  • Support for Elastic MetricBeat
  • Added HTTP REST API to interact with SOCA
  • Users can now decide to restrict a job to Reserved Instances
  • Revamped Web Interface
    • Added filesystem explorer
    • Users can upload files/folders via drag & drop interface
    • Users can edit files directly on SOCA using a cloud text editor
    • Users can now manage membership of their own LDAP group via web
    • Users can now understand why they job is not started (eg: instance issue, misconfiguration, AWS limit, license limit) directly on the UI
    • Users can kill their job via the web
    • Admins can manage SOCA LDAP via web (create group, user, manage ownership and permissions)
    • Admins can creates application profiles and let user submit job via web interface
    • Ability to trigger Linux commands via HTML form
  • Admins can now limit the number of running jobs per queue
  • Admins can now limit the number of running instances per queue
  • Admins can now specify the idle timeout value for any DCV sessions. Inactive DCV sessions will be automatically terminated after this period
  • Job selection can now configured at queue level (FIFO or fair share)
  • Dry run now supports vCpus limit
  • Support for custom shells

Changed

  • Updated Troposphere to 2.6.1
  • Updated EFA to 1.9.3
  • Updated Nice DCV to 2020.0-8428
  • Updated OpenSearch (formerly Elasticsearch) to 7.4
  • You can specify a name for your DCV sessions
  • You can now specify custom AMI, base OS or storage options for your DCV sessions
  • Project assigned to DCV jobs has been renamed to "remotedesktop" (previously "gui")
  • Dispatcher script is now running every minute
  • SOCA now deploys 2 instances for OpenSearch (formerly Elasticsearch) for high availability
  • Users can now specify DEPLOYMENT_TYPE for their FSX for Lustre filesystems
  • Users can specify PerUnitThroughput when FSx for Lustre deployment type is set to PERSISTENT
  • DCV now supports G4 instance type (#24)
  • X11 is now configured correctly for ALI 3D DCV session (#23)

[2.0.1] - 2020-04-20

Added

  • Support for SpotFleet

Changed

  • NVIDIA drivers are now automatically installed when a GPU instance is provisioned
  • Deployed MATE Desktop for DCV for Amazon Linux 2

[2.0.0] - 2020-03-18

Added

  • Support for MixedInstancePolicy and InstanceDistribution
  • Support for non-EBS optimized instances such as t2
  • Integration of AWS Session Manager
  • Integration of AWS Backup
  • Integration of AWS Cognito
  • Integration of Troposphere
  • Admins can now manage ACL (individual/LDAP groups) at queue level
  • Admins can now restrict specific type/family of instance at queue level
  • Admins can now prevent users to change specific EC2 parameters
  • Users can now install SOCA using existing resources such as VPC, Security Groups ...
  • Users now have the ability to retain EBS disks associated to a simulation for debugging purposes
  • SOCA now prevent jobs to be submitted if .yaml configuration files are malformed
  • Scheduler Root EBS is now tagged with cluster ID
  • Scheduler Network Interface is now tagged with cluster ID
  • Scheduler and Compute hosts are now sync with Chrony (Amazon Time Sync)
  • Support for FSx for Lustre new Scratch2/Scratch1 and Persistent mode
  • Added Compute nodes logs on EFS (/apps/soca/<cluster_id>/cluster_node_bootstrap/logs/<job_id>//*.log) for easy debugging

Changed

  • Ignore installation if PBSPro is already configured on the AMI
  • Fixed bug when stack name only use uppercase
  • ComputeNode bootstrap scripts are now loaded from EFS
  • Users can now open a SSH session using SSM Session Manager
  • Processes are now automatically launched upon scheduler reboot
  • Max Spot price now default to the OD price
  • Default admin password now supports special characters
  • Ulimit is now disabled by default on all compute nodes
  • Dispatcher automatically append "s3://" if not present when using FSx For Lustre
  • Updated default OpenSearch (formerly Elasticsearch) instance to m5.large to support encryption at rest
  • SOCA libraries are now installed under /apps/soca/<CLUSTER_ID> location to support multi SOCA environments
  • Web UI now display the reason when a DCV job can't be submitted
  • Customers can now provision large number of EC2 hosts across multiple subnets using a single API call
  • Smart detection of Placement Group requirement when using more than 1 subnet
  • Added retry mechanism for some AWS API calls which throttled when provisioning > 1000 nodes in a single API call
  • ALB Target Groups are now correctly deleted once the DCV sessions is terminated
  • SOCA version is now displayed on the web interface
  • Updated EFA version to 1.8.3

[1.0.0] - 2019-11-20

  • Release Candidate