Infrastructure Specialist

TELESAT

Ontario, Canada

Full time

IT / System / Network

{{field.value|getBooleanValue}}

Aug 28

MAIN RESPONSIBILITIES:

  • Oversee and manage operations of a high-performance compute cluster (HPC) utilizing Linux and Slurm.
  • Oversee and manage the infrastructure for high density CPU intensive clusters.
  • Design and build highly available server/storage clusters and provide recommendations and feedback to the project team for IT infrastructure based on software requirements.
  • Deployment of server infrastructure including hypervisors and connected datastores. Configuration of server and storage network parameters.
  • Deployment and management of related storage, hypervisor datastores, replication and backups.
  • Performing installations, customizations and patching of hypervisors, operating systems, security, and other operations tooling.
  • Establishment and set up of hardware monitoring and alerting.
  • Building out basic infrastructure services (e.g., DNS, DHCP, NTP, SMTP).
  • Collaborate with the Facilities teams to ensure that all Telesat requirements meet business needs, and to ensure that power and cooling are accurately specified to meet required specifications and availability.
  • Implement and collaborate with stakeholders to ensure that all physical and virtual access to assets are secured and follow appropriate security processes and standards.
  • Engage with the Network teams to ensure both LAN connectivity meets availability and bandwidth requirements.
  • Work with the Network team to ensure cabling is managed and tracked within the data centres.
  • Deployment of new server, storage and network infrastructure as required. This includes the racking and stacking and building of the compute and storage environments.
  • Connection and testing of rack power and network cabling.
  • Document, track, and monitor problems to ensure timely resolution.
  • Document operating processes, runbooks and build books for data centre operations.
  • Assist in audits and forensics on artifacts collected during security incidents.
  • Perform occasional after-hours maintenance on production systems.
  • Incident on-call rotation as required.
  • Day-to-day operational support. 

EDUCATION & EXPERIENCE REQUIRED

  • A Diploma or Degree in a relevant area of study with a preference for Computer Science together with demonstrated operational network-related experience.
  • Minimum of 4 years in supporting IT infrastructure in an Engineering environment.
  • Industry certifications in VMware, Windows and/or Linux would be an asset.
  • Significant experience with x86 compute hardware and current enterprise storage systems.
  • Significant and recent experience with Linux (e.g., RedHat, Ubuntu) in an enterprise environment.
  • Significant and recent experience with Windows Server (2016+) in an enterprise environment.
  • Significant and recent experience with VMware vSphere and vCenter.
  • Significant experience in scripting (e.g., PowerShell, bash).
  • Working technical knowledge of network systems.
  • Working technical knowledge of systems software, protocols and standards including Active Directory.
  • Working knowledge of utilities and tools to support and monitor servers and storage devices.
  • Working knowledge of the installation and operation of Kubernetes (k8s).
  • Working knowledge of security principles and securing systems.
  • Excellent written and oral communication skills.
  • Excellent problem-solving skills.
  • Strong interpersonal and organizational skills.
  • Ability to speak effectively before groups of internal employees, communicate technical information, create, and deliver presentations and information sessions to both technical and nontechnical personnel.
  • Demonstrated experience in applying technical expertise and in-depth evaluation to solve complex problems in own area of expertise.
  • Bilingual (English/French) is an asset.

DECISION MAKING & SUPERVISION:

  • Make decisions and recommendations based on Lightspeed compute requirements within scope.
  • Work under minimum supervision.

WORKING CONDITIONS:

  • Generally comfortable working conditions with lifting and onsite installations.
  • Lifting and moving of IT equipment.
  • Occasional travel within Ontario/Quebec may be required from time to time (15%).
  • Moderate visual concentration in use of video display terminal.
  • Appropriate security clearances required.
  • Occasional off-hours support may be required.
  • Participation in group pager rotation (‘on call’) as required

  • The successful candidate must be able to work in Canada and obtain clearance under the Canadian Controlled Goods program (CGP).

At Telesat, we take pride in being an equal opportunity employer that values equality in the workplace.  We are committed to providing the best candidate experience possible including any required accommodations at every stage of our interview process.  All qualified applicants that have been selected for an interview that require accommodations, are advised to inform the Telesat Talent team accordingly. We will work with you to meet your needs.  All accommodation information provided will be treated as confidential.

Apply for this position Back to job

You must be logged in to apply to this job.

{{notification.msg}}