Admin Operations Runbook
This runbook gives tenant administrators a practical operating rhythm for managing users, roles, departments, approvals, overdue work, and monthly platform health.
1. Purpose
Tenant administrators keep the platform usable, accurate, and governed. Their work is not to own every risk or control, but to ensure the right people, permissions, workflows, and data quality practices are in place.
Use this runbook for routine administration:
- onboarding and offboarding users
- assigning roles and access
- maintaining departments and ownership
- reviewing pending approvals
- monitoring overdue work
- keeping lookup values clean
- performing monthly health checks
2. Daily Admin Checklist
| Task | What to Check | Action |
|---|---|---|
| Pending approvals | Policies, assessments, exceptions, or workflow items waiting for approval. | Remind approvers or reassign if ownership changed. |
| Overdue work | Issues, findings, assessments, risk reviews, policy reviews, vendor reviews. | Escalate to owner and manager when needed. |
| Critical alerts | Critical risks, expiring exceptions, urgent findings. | Confirm owner action and management visibility. |
| Failed ownership | Records assigned to inactive or wrong users. | Reassign to the correct owner. |
| New requests | User access, role change, department change, report access. | Validate request and apply least-privilege access. |
3. Weekly Admin Checklist
| Task | What to Check | Action |
|---|---|---|
| User access review | New users, inactive users, role changes. | Remove unnecessary access and confirm business need. |
| Assessment progress | Assessments stuck, overdue, or missing evidence. | Follow up with assessment owners and control owners. |
| Risk review progress | High and critical risks approaching review date. | Ask risk owners for update or treatment decision. |
| Policy lifecycle | Drafts, pending approvals, acknowledgement campaigns. | Push policies through approval or update audience. |
| Vendor and asset hygiene | Missing owners, criticality, or review dates. | Correct ownership and required fields. |
| Scheduled reports | Recipient list and report usefulness. | Remove obsolete recipients and stale schedules. |
4. Monthly Admin Checklist
- Review dashboard and KPI trends.
- Review data quality exceptions across modules.
- Confirm high and critical risks have current treatment and review dates.
- Review overdue findings and issues with accountable owners.
- Review policy exceptions expiring within 30 days.
- Review pending policy acknowledgements.
- Review vendor assessments and high-risk vendors.
- Review inactive users and excessive privileges.
- Review scheduled report recipients.
- Prepare a short management summary with actions, blockers, and escalations.
5. User Onboarding
When adding a user:
- Confirm the business need for access.
- Confirm department, manager, and job responsibility.
- Assign the minimum role needed.
- Add module-specific access only when required.
- Confirm whether the user owns controls, risks, policies, findings, assets, or vendors.
- Notify the user of their expected responsibilities.
- Add the user to relevant assessment, policy acknowledgement, or workflow assignments.
Recommended checks:
- Do not assign administrator access for normal business ownership.
- Do not give approval permission to the same user who submits the item unless policy allows it.
- Make sure control owners can see and respond to assigned controls.
- Make sure executives have read-only access unless they also perform approvals.
6. User Offboarding
Before deactivating or removing access:
- Search for records owned by the user.
- Reassign open risks, controls, issues, findings, policies, vendors, assets, and assessments.
- Reassign pending approvals.
- Remove scheduled report delivery if no longer needed.
- Remove elevated roles first.
- Deactivate access after ownership is transferred.
- Record any unresolved ownership gaps for follow-up.
Common mistake: removing a user before reassigning ownership. This creates orphaned work and weakens reporting.
7. Assigning Roles
Use least privilege: assign the smallest role that allows the user to complete their job.
| User Type | Recommended Access |
|---|---|
| Tenant administrator | User, role, department, workflow, setup, and reporting administration. |
| Compliance manager | Assessments, controls, evidence, findings, compliance reports. |
| Risk manager | Risk register, appetite, treatments, risk reports, related issues. |
| Control owner | Assigned controls, evidence, assessment responses, remediation tasks. |
| Auditor | Audit plans, findings, evidence review, audit reports. |
| Executive viewer | Dashboards, KPIs, high-level reports, read-only views. |
| Vendor owner | Assigned vendor records, assessments, evidence, risk follow-up. |
| Policy owner | Drafting, review, acknowledgement tracking, exception review where appropriate. |
8. Departments and Ownership
Departments support filtering, reporting, and accountability.
When maintaining departments:
- keep department names consistent with the organization structure
- avoid duplicate department names with small spelling differences
- assign users to the correct department
- review department ownership when teams reorganize
- update record ownership when departments merge or split
If departments are inaccurate, reports by business area become unreliable.
9. Lookup Values and Configuration Hygiene
Lookup values should be stable and meaningful.
Review:
- risk categories
- risk treatment options
- issue priorities
- finding severities
- asset types
- vendor criticality values
- policy categories
- incident categories
- workflow statuses where configurable
Avoid creating duplicate values such as “High”, “high”, and “High Risk” for the same meaning.
10. Pending Approvals Review
For each pending approval, check:
- who submitted it
- who must approve it
- how long it has been waiting
- whether the approver is active
- whether the item has enough information for approval
- whether the decision is urgent
Escalate approvals that block assessments, policy publication, risk acceptance, or exception expiry.
11. Overdue Work Review
Overdue work should be managed from the source module.
| Overdue Item | Primary Follow-Up |
|---|---|
| Assessment | Assessment owner and control owners. |
| Control evidence | Control owner. |
| Risk review | Risk owner. |
| Treatment action | Treatment action owner. |
| Policy review | Policy owner. |
| Policy acknowledgement | Target audience owner or department manager. |
| Audit finding | Finding owner. |
| Issue | Assigned issue owner. |
| Vendor review | Vendor owner. |
Escalate repeated overdue items to the accountable manager, not only the assigned user.
12. Monthly Health Summary
A useful monthly admin summary should include:
- compliance score and major movement
- active and overdue assessments
- high and critical risks
- overdue issues and findings
- pending approvals
- expiring exceptions
- policy acknowledgement gaps
- high-risk vendors
- KPI breaches
- data quality concerns
- decisions required from management
Keep the summary action-oriented. Management should be able to see what needs a decision, who owns it, and by when.
13. Common Admin Mistakes
- granting administrator access too broadly
- leaving inactive users with ownership
- treating reports as the source of truth instead of correcting source records
- allowing overdue work without escalation
- scheduling reports without a clear audience
- publishing policies before the audience and acknowledgement requirement are correct
- closing findings or issues without evidence
- accepting risks without documented justification
- creating duplicate departments, categories, or lookup values
14. Escalation Template
Use this structure when escalating:
| Field | What to Include |
|---|---|
| Record | Name or reference of the item. |
| Owner | Current accountable owner. |
| Issue | What is overdue, incomplete, blocked, or high risk. |
| Impact | Compliance, risk, audit, operational, or reporting impact. |
| Required Decision | What management must decide or approve. |
| Due Date | When action is required. |
| Recommended Next Step | Reassign, approve, remediate, accept, extend, or close. |
15. Production Deployment
Before deployment:
- Confirm the release tag, commit SHA, and approved change ticket.
- Confirm database backup completed successfully.
- Confirm required secrets and environment variables are present for the target environment.
- Confirm
ASPNETCORE_ENVIRONMENT=Production. - Confirm allowed CORS origins match the production frontend domain.
- Confirm SMTP settings are valid.
- Confirm DataProtection keys are mounted to persistent storage or an approved key store.
- Run the build and test commands from the production action plan.
Deployment sequence:
- Put the deployment in the maintenance window or announce the no-downtime release.
- Deploy backend and frontend artifacts.
- Run pending database migrations.
- Restart backend workers and API instances.
- Verify
/healthor the configured health endpoint. - Sign in as a platform administrator and tenant administrator.
- Run the smoke workflow: login, dashboard load, tenant list or tenant dashboard, one list export, one report export.
- Monitor logs, error reports, and failed background jobs for at least 30 minutes.
16. Rollback
Rollback is required when deployment causes login failure, data corruption risk, unavailable critical workflows, or broken executive reporting.
Rollback sequence:
- Stop new deployments and capture current logs.
- Identify whether rollback is application-only or requires database restore.
- If no destructive migration ran, redeploy the previous backend and frontend artifacts.
- If a destructive or incompatible migration ran, restore the approved database backup to a replacement database and point the app to it.
- Restore the previous environment variables if they changed.
- Restart services and verify health checks.
- Run smoke checks and confirm with the business owner.
- Record the incident, root cause, customer impact, and follow-up fix.
17. Database Backup
Minimum backup controls:
- run automated daily full backups
- run point-in-time recovery if supported by the database platform
- encrypt backups at rest
- store backups outside the application host
- restrict restore permissions to operations owners
- test restore at least monthly
Before high-risk operations:
- Take an on-demand backup.
- Record backup name, timestamp, database name, and operator.
- Confirm backup completion and size.
- Keep the backup until the release is accepted.
18. Database Restore
Restore sequence:
- Confirm restore approval and target recovery point.
- Restore to a separate database first.
- Run schema compatibility checks against the deployed application version.
- Validate tenant count, admin users, critical records, and recent audit logs.
- Switch the application connection string only after validation.
- Restart application services.
- Run smoke checks and document the restored timestamp.
Never overwrite the production database before validating the backup in an isolated target.
19. Migration Execution
Before migrations:
- Review migration files and identify destructive operations.
- Confirm backup is complete.
- Confirm no long-running imports, exports, or scheduled reports are active.
- Run migrations once per environment from the approved deployment job.
After migrations:
- Confirm migration history table includes the expected migration.
- Confirm backend starts without schema errors.
- Run login, dashboard, list, report, and export smoke checks.
- Keep migration output with the release evidence.
20. DataProtection Key Backup
DataProtection keys protect cookies and tokens. Losing them can invalidate sessions and break encrypted payloads.
Required controls:
- store keys in persistent storage or a managed key store
- back up the key location with production backups
- restrict write access to the application identity and operations owners
- do not rotate by deleting old keys
- verify all backend instances share the same key ring
Recovery:
- Restore the key ring before restarting replacement application instances.
- Confirm users can sign in without forced global logout unless key loss is accepted.
- If keys are lost, announce session invalidation and require users to sign in again.
21. Log Retention
Minimum retention:
| Log Type | Minimum Retention |
|---|---|
| Application logs | 90 days |
| Security and audit logs | 1 year |
| Deployment logs | 1 year |
| Database backup and restore logs | 1 year |
| Error reports | 180 days |
Operational checks:
- confirm logs include environment, tenant, user id where safe, correlation id, and timestamp
- avoid storing passwords, tokens, secrets, or unnecessary personal data
- review critical errors daily during launch and weekly after stabilization
- preserve logs for open incidents until closure
22. Incident Response
Use this process for outages, security events, data integrity issues, failed deployments, and critical reporting failures.
- Declare severity and incident owner.
- Preserve logs, deployment version, database state, and screenshots.
- Stabilize service: rollback, disable feature, scale, or restore as needed.
- Communicate impact, workaround, and next update time.
- Track timeline, decisions, and actions.
- Confirm recovery with smoke checks and business owner validation.
- Complete root-cause analysis.
- Add preventive action to the backlog and assign an owner.
23. Secret Rotation
Rotate secrets when scheduled, when an employee with access leaves, after suspected exposure, or before customer launch if test secrets were used.
Rotation sequence:
- Identify dependent services and rollback plan.
- Create the new secret in the approved secret store.
- Update application configuration without exposing the value in logs.
- Restart affected services.
- Validate login, email, database, storage, webhooks, and SSO integrations as applicable.
- Revoke the old secret after validation.
- Record the rotation date, operator, systems affected, and next rotation due date.
24. Tenant Onboarding
Tenant onboarding sequence:
- Confirm customer name, tenant code, domain, admin contact, support contact, and go-live date.
- Create or onboard the tenant from the platform tenant administration screen.
- Configure first administrator and require password change or invitation acceptance.
- Enable only purchased or approved modules.
- Sync tenant permissions and workflows.
- Configure SMTP sender, branding, CORS/domain settings, and documentation URL if tenant-specific.
- Seed approved frameworks, roles, and starter data only when accepted.
- Run tenant smoke checks: admin login, dashboard, user invite, one module list, one report.
- Record launch approval and accepted risks.
25. Tenant Offboarding
Tenant offboarding sequence:
- Confirm commercial, legal, and data-retention approval.
- Disable tenant access or put the tenant in suspended state.
- Export customer-requested records where contractually required.
- Preserve audit logs and backups for the agreed retention period.
- Remove scheduled reports, webhooks, and external integrations.
- Delete or anonymize data only after legal approval and backup-retention review.
- Record final status, retained data, deletion date, and approver.
26. Support Impersonation
Support impersonation is high risk and must be exceptional.
Required controls:
- use only approved platform administrator accounts
- require a support ticket, customer approval, reason, tenant, user, and time window
- grant the minimum duration needed
- record every impersonation start and end in audit logs
- never change customer data unless the ticket explicitly authorizes it
- never use impersonation to bypass segregation of duties
Impersonation closure:
- End impersonation immediately after the support action.
- Record actions taken and records viewed or changed.
- Attach evidence to the support ticket.
- Ask the customer administrator to verify the issue is resolved.
27. Customer Launch Checklist
Complete this checklist before giving a customer production access.
| Area | Required Check | Evidence |
|---|---|---|
| Domain and TLS | Production domain resolves correctly and TLS certificate is valid. | Domain, certificate expiry date, verification screenshot. |
| SMTP | Production SMTP sender works and SPF/DKIM/DMARC are aligned where applicable. | Test email, sender domain, delivery result. |
| CORS | Backend allows only approved production frontend origins. | Environment variable or configuration screenshot. |
| Backups | Automated backups are enabled and one restore test has passed. | Backup job name, latest backup timestamp, restore test date. |
| Monitoring | Health checks, application logs, error reports, and alert recipients are configured. | Monitoring dashboard URL and alert recipient list. |
| Admin accounts | Platform and tenant admin accounts are named users, active, and protected. | Account list and owner approval. |
| Seed data | Seeded frameworks, roles, users, and starter records are approved for this customer. | Seed data approval or "none" confirmation. |
| First tenant | Tenant code, display names, modules, workflows, and permissions are correct. | Tenant detail screenshot or checklist note. |
| Support contacts | Customer admin, technical contact, escalation contact, and support mailbox are recorded. | Contact list. |
| Docs URL | Customer-facing documentation URL is configured and reachable. | URL and access check. |
| Legal and privacy notice | Privacy notice, terms, data processing, and retention positions are accepted. | Legal approval reference. |
| Accepted risks | Any known production limitations are documented and accepted by product, security, and operations owners. | Accepted-risk register or signed launch note. |
Launch approval:
- Product owner confirms user-facing workflow readiness.
- Security owner confirms access, tenant isolation, secrets, and logging readiness.
- Operations owner confirms deployment, backup, restore, monitoring, and support readiness.
- Customer owner confirms first tenant, admin users, contacts, and go-live timing.
- Release owner records the final go/no-go decision.