Base Test Stand¶
This section contains site specific variations for the Base test stand.
Resources¶
- LOVE: http://love01.ls.lsst.org
- Argo CD: https://base-lsp.lsst.codes/argo-cd
- Chronograf: https://base-lsp.lsst.codes/chronograf
- Nublado: https://base-lsp.lsst.codes/
- Rancher: https://rancher.ls.lsst.org (1)
- Slack: #rubinobs-base-teststand
(1) Need to get kubeconfig file from here. File a Jira ticket with IT for access. Once able to log into Rancher:
- Select the manke cluster.
- Click the Kubeconfig File button in top-right.
- Near bottom of dialog, click the download link.
Non-Production Systems¶
The Base Teststand operates all CSCs and systems on the production domain.
Bare Metal Machines¶
- Main OSPL Daemon: azar01.ls.lsst.org
- LOVE: love01.ls.lsst.org
- T&S CSCs: tel-hw1.ls.lsst.org
- Kubernetes: Can be done from own machine, just need kubeconfig file and kubectl installed.
- Systems run on the manke cluster.
- Can also use: https://k8slens.dev/.
- ATCamera (Tony Johnson): auxtel-mcm.ls.lsst.org
LOVE Summary View¶
The overall system summary state view is called Summary State
.
Checking the Number of Federations¶
This uses a script in https://github.com/lsst-ts/k8s-admin. Run ./feds-check from a machine with kubectl and the proper kubeconfig file.
Shutdown Camera Services¶
- Shutdown ATCamera OCS Bridges:
- sudo systemctl stop ats-ocs-bridge.service
- Shutdown Camera Daemons
- sudo systemctl stop opensplice.service
Shutdown LOVE¶
This needs to be done from love01.
- Uses the
docker-compose-admin
scripts inbase-teststand/love01
directory, which are are linked into the dco user home directory. - Become the dco user: sudo -iu dco
- ./shutdown_love
- ./shutdown_daemon
- Uses the
Shutdown T&S Bare Metal Services¶
Handle tel-hw1:
- Uses the
docker-compose-admin
scripts inbase-teststand/tel-hw1
directory, which are are linked into the dco user home directory. - Become the dco user: sudo -iu dco
- ./shutdown_atmcs_atp
- ./shutdown_daemon
- Uses the
Interacting with Kubernetes¶
Commands can be executed from your own machine with kubectl and the manke.yaml kubeconfig file.
Download from https://rancher.ls.lsst.org/dashboard
Shutdown Main Daemon¶
This needs to be done from azar01.
- Uses the
docker-compose-admin
scripts inbase-teststand/azar01
, which are are linked into the dco user home directory. - Become the dco user: sudo -iu dco
- ./shutdown_daemon
- Uses the
Update Configuration¶
- Gather the branch for the configurations and version number for
ts_ddsconfig
. - Uses the
docker-compose-admin/base-teststand/update_repo
script, which is linked into the dco user home directory. - Repos to update:
docker-compose-ops
(azar01, love01, tel-hw1)LOVE-integration-tools
(love01)ts_ddsconfig
(love01, tel-hw1) NOTE: Only necessary if there are updates.
- Become the dco user: sudo -iu dco
- ./update_repo <repo path> <branch or version>
Startup Main Daemon¶
This needs to be done from azar01.
- Uses the
docker-compose-admin
scripts inbase-teststand/azar01
directory, which are linked into the dco user home directory. - Become the dco user: sudo -iu dco
- ./launch_daemon
- Ensure daemon is ready before proceeding.
- Uses the
Startup Minimal Kubernetes System¶
This replaces most of step 6.3 in the main document. Follow the first three bullet points in that step and then continue the process with the next steps.
- *python sync_apps.py -p
- csc-cluster-config and ospl-config apps will be synced automatically.
- Once the ospl-daemon app is synced, the script will pause.
- Check the logs on Argo CD UI to see if daemons are ready.
- Type
go
and enter to move onto syncing the kafka-producers app. - Script will again pause once the kafka-producers are synced.
- The kafka-producers use a startup probe, so once all of the pods show a green heart, stop here and return to step 6.4 in the main document.
- Make sure you leave the script running.
Startup LOVE¶
This needs to be done from love01.
- Uses the
docker-compose-admin
scripts inbase-teststand/love01
directory, which are linked into the dco user home directory. - Become the dco user: sudo -iu dco
- ./launch_daemon
- Ensure daemon is ready before proceeding.
- ./launch_love
- Uses the
Startup Camera Services¶
This needs to be done from auxtel-mcm.
- Start Camera Daemons
- sudo systemctl start opensplice.service
- Start Camera OCS Bridges:
- ATCamera: sudo systemctl start ats-ocs-bridge.service
- Ensure bridge services are running:
- ATCamera: sudo systemctl status ats-ocs-bridge.service
- Transition to OFFLINE_AVAILABLE:
- ATCamera:
- ccs-shell
- ccs> set target ats-ocs-bridge
- ccs> lock
- ccs> setAvailable
- ccs> unlock
- ccs> exit
Startup T&S Bare Metal Services¶
Handle tel-hw1
- Uses the
docker-compose-admin
scripts inbase-teststand/tel-hw1
directory, which are linked into the dco user home directory. - Become the dco user: sudo -iu dco
- ./launch_daemon
- Ensure daemon is ready before proceeding.
- ./launch_atmcs_atp
- Uses the
Enabled CSCs¶
If proceeding with integration testing, the CSCs will be brought to ENABLED state as part of that process. All of the startup processes may be necessary for recovering the BTS from any maintenance. The following components will automatically transition to ENABLED state when launched:
- WeatherForecast
- ScriptQueue:1
- ScriptQueue:2
- DSM:1
- DSM:2
For the other components, the BTS will be handled in the same way as the Summit. For reference, see Observatory and Control System Guidelines for BTS
Only leverage the following scripts, if necessary. Required configurations will be given for each script execution.
set_summary_state.py
data: - [ESS:1, ENABLED] - [ESS:101, ENABLED] - [ESS:102, ENABLED] - [ESS:103, ENABLED] - [ESS:104, ENABLED] - [ESS:105, ENABLED] - [ESS:201, ENABLED] - [ESS:202, ENABLED] - [ESS:203, ENABLED] - [ESS:204, ENABLED] - [ESS:205, ENABLED] - [ESS:301, ENABLED] - [Watcher, ENABLED]