
Modern supercomputers such as Argonne's Aurora Exascale machine are large and complex, and comprise so many components that a small failure rate per component translates into an appreciable failure rate for the entire machine. In order for applications to be made resilient in the face of such unavoidable failures, it is necessary to characterize the application failure rate with good precision, and if possible with some specificity regarding the type and hardware usage pattern of applications. This requires combining of various streams of logging information, and nontrivial statistical analysis, possibly supplemented by machine learning techniques. This project targets the study of application failure rates on Aurora, based on such analyses. The student will explore the log data streams to discover effective ways of tracking application failure rates, write code to analyze, categorize, and visualize application interrupts, and help develop models of machine computational efficiency.
Education and Experience Requirements
The entirety of the appointment must be conducted within the United States.
• Applicants must be:
o Currently enrolled in undergraduate or graduate studies at an accredited
institution.
o Graduated from an accredited institution within the past 3 months; or
o Actively enrolled in a graduate program at an accredited institution.
• Must be 18 years or older at the time the appointment begins.
• Must possess a cumulative GPA of 3.0 on a 4.0 scale.
• Must be a U.S. citizen or Legal Permanent Resident at the time of application.
• If accepting an offer, candidates may be required to complete pre-employment drug testing based on appointment length. All students remain subject to applicable drug testing policies.
Job Family
DOE Seasonal Intern
Job Profile
DOE - SULI (Science Undergraduate Laboratory Internship)
Worker Type
Contingent Worker
Time Type
Full time
Scheduled Weekly Hours
40
EEO Information
As an equal employment opportunity employer, and in accordance with our core values of impact, safety, respect, integrity and teamwork, Argonne National Laboratory is committed to a safe and welcoming workplace that fosters collaborative scientific discovery and innovation. Argonne encourages everyone to apply for employment. Argonne is committed to nondiscrimination and considers all qualified applicants for employment without regard to any characteristic protected by law.
Argonne employees, and certain guest researchers and contractors, are subject to particular restrictions related to participation in Foreign Government Sponsored or Affiliated Activities, as defined and detailed in United States Department of Energy Order 486.1A. You will be asked to disclose any such participation in the application phase for review by Argonne's Legal Department.

Argonne National Laboratory, one of the U.S. Department of Energy's national laboratories for science and engineering research, employs 3,400 employees, including 1,400 scientists and engineers, three-quarters of whom hold doctoral degrees. Argonne's annual operating budget of around $1 billion supports upwards of 200 research projects. Since 1990, Argonne has worked with more than 600 companies and numerous federal agencies and other organizations.
Argonne's mission is to apply a unique mix of world-class science, engineering and user facilities to deliver innovative research and technologies. We create new knowledge that addresses the most important scientific and societal needs of our nation.
We actively seek opportunities to work with industry to transfer our technologies to the marketplace through licensing, joint research and many other collaborative relationships.
Argonne is managed by UChicago Argonne, LLC, for the U.S. Department of Energy's Office of Science. We are located on 1,500 acres (6.9 sq. km) in southwest DuPage County, Illinois 25 miles (40 km) southwest of Chicago. The site is completely encircled by the beautiful Waterfall Glen Forest Preserve.