Currently, there are two open position:
PhD position(1): Designing Fault-Tolerant Distributed Systems for Nanosat Constellations
PhD position(2): Design of fault-tolerant distributed systems for swarms of cyber-physical systems (applications on drones/nanosats/UUVs)
PhD position: Designing Fault-Tolerant Distributed Systems for Nanosat Constellations
Keywords: orbital edge computing, distributed algorithms, cyber-physical systems, replication, distributed and resilient computing, fault tolerance, intermittent computing, nanosats' subsystems.
Context
The recent deployment of an increasing number of nanosatellites in low-earth orbit (LEO) presents new opportunities for space applications [DKL + 17, DL20]. Built atop small-sized yet powerful blocks,a.k.a. CubeSats or simply nanosats, nanosatellite constellations emerge as promising platforms for massive sensing and large-scale distributed computing. Indeed, they represent a cheaper, competitive alternative for traditional satellite systems for a wide range of application domains such as earth observation and defence.
However, the design of distributed, intelligent systems based on nanosats is particularly challenging: nanosats have more stringent physical limitations with respect to processing/networking capability, energy supply, and connectivity among nanosats. Moreover, the use of cheaper components and subsystems might expose the emerging nanosat applications to performance degradation or complex failures. Therefore, novel resilient distributed applications and protocols should be designed and evaluated to make efficient and reliable use of the resources of nanosats at the orbital edge. The proposed doctoral project aims to enable a first-of-its-kind orbital edge computing subsystems with nanosats and to design novel techniques to support reliable and efficient data processing for emerging sensing application like earth observation with the proposed orbital edge computing platform. In order to achieve this challenging goal, we will conduct interdisciplinary, collaborative research to answer the following questions:
-
How to enable distributed computing on a nanosat?
We will survey the design and implementation of state-of-the-art building blocks including suitable communication protocols and specific subsystems interfaces and abstractions for computing on nanosats. -
How to build a resilient computing system with a set of nanosats?
We will investigate distributed systems problems and propose specific solutions for dynamic reconfiguration mechanisms, consensus algorithms, and data replication schemes on nanosats systems. For that, we will take into account the ongoing research on related topics at CNES, including clock synchronization.
Proposed research
This doctoral research project aims to address the above scientific challenges as follows:
- Leveraging non-expensive, failure-prone nanosats' components. We will explore the design space and the performance evaluation of distributed system on nanosats constellations. Based on an existing, representative hardware platform proposed by the CNES, we aim to conduct a systematic study on how different choices of distributed systems primitives and designs affect the performance of key services, such as special-purpose sensing and distributed processing application. To this end, we will execute specific benchmarks to identify design opportunities, to assess the impact of different failures and to better understand the eventual trade-offs for distributed computing on nanosats.
- Resilient edge computing with constellations of nanosats. An interesting solution for processing large amount of sensing data is to build a distributed computing system with a set of nanosats. So that we will re-examine many assumptions in traditional distributed systems in the presence of processing and interconnectivity limitations of nanosats. In particular, we aim to design novel resilient applications and protocols for fault-tolerant distributed services, e.g., dynamic reconfiguration mechanisms, consensus protocols, and replication schemes. Based on these fundamental services, we will enable intelligent, distributed computing on nanosats constellations.
Currently, the availability and resilience of traditional, cloud-based distributed system are commonly guaranteed by a replication protocol based on replicated state machine (RSM). Such a protocol implements a consensus algorithm to enable strong consistency, like Fast Paxos [Lam06] and Raft [OO14]. Strongly consistent replication is key to efficient implementation of critical distributed systems’ building blocks, like distributed lock manager, reliable configuration or transactional key-value store. To our knowledge though, such protocols have never been designed and extensively evaluated on nanosatellite constellations. Finally, it is important to highlight that this doctoral project is a joint work with our colleagues in University of Adelaide in Australia, therefore the doctoral student will spend at least one year working in Australia. In addition, the doctoral candidate will have a joint doctoral degree by the end of the project.
Requirements and application
In this research project, we intend to explore both a fundamental and an applied aspects. Candidates to this position should hold a Master’s degree in Computer Science/Informatics, Mathematics or a related field by the starting date of the doctoral project. They must be excited by research in distributed systems/computing, distributed algorithms, orbital edge computing, and/or intermittent computing, and should have an excellent academic record in one of these areas. Familiarity with formal specification and verification, and graph theory/algorithms would be greatly appreciated. Teamwork and communication skills are key to this position, and industrial experience is a plus.
Knowledge of French is not required.
To apply, please send the following information to ds-resco-recruitment@lists.recherche.enac.fr(Subject=PhD position [ENAC-TESA-PhD24]: fault-tolerant distributed computing):
- Curriculum Vitæ
- Letter of motivation that should describe the applicant's background in the areas of the project, reason for interest in the project, and future plans
- A list of courses and grades of the last three years of study (an informal transcript is OK).
- Names and contact details of at least two people who can write you references, whom we will contact directly.
- If relevant, a link to your publications and/or open-source developments.
This fully-funded PhD starts in September 2024 and the duration of the contract/scholarship is 3 years.
Eligibility criteria and Benefits
Applicants of any nationality can apply, but applicants must not have a doctoral degree already or been enrolled in a PhD/doctoral program.
Benefits include:
- French government strongly subsidizes higher education, therefore the tuition fees are among the more competitive in Europe
- Doctoral students are eligible for an accommodation in our own campus which contributes significantly to reduce the total cost of living (already much cheaper than bigger European cities)
- Social security coverage included
- Subsidized meals
- Partial reimbursement of public transport costs
- Social, cultural and sports events and activities
About ENAC
The ENAC, National School of Civil Aviation, is located in Toulouse, France, the centre of the European aerospace industry (e.g., AirBus, Thales, and CNES). It offers an ideal working environment, where researchers can focus on developing new ideas, collaborations and projects.
Our research topics at ENAC Lab include emerging CPS design (e.g., drones and nanosatellites), aviation safety and security, sustainable transportation development, and aeronautical computer-human interactions. For further information, please consult our
site.
The proposed research will be developed in the ENAC research laboratory, ENAC Lab, in close cooperation with TéSA, an important industrial partner in Toulouse and the University of Adelaide in Australia.
References
[DKL + 17] K. Devaraj et al. Dove high speed downlink system. 2017.
[DL20] B. Denby and B. Lucia. Orbital edge computing: Nanosatellite constellations as a new class of computer system. In the ASPLOS, 2020.
[Lam06] L. Lamport. Fast paxos. Distributed Computing, 2006.
[OO14] D. Ongaro and . Ousterhout. In search of an understandable consensus algorithm. In the ATC, 2014.
PhD position: Design of fault-tolerant distributed systems for swarms of cyber-physical systems (applications on drones/nanosats/UUVs)
Keywords: distributed algorithms; fault tolerance; replication; parallel computing; edge computing; mobile ad hoc networks; formal specification and verification; cyber-physical systems
Context
There is an increasing interest in Cyber-Physical Systems, such
as swarm of unmanned aerial vehicles (UAV or drones) or unmanned underwater vehicles (UUV), constellations
of nano-satellites (nanosats), largely because they present
unique opportunities for real-time, more efficient applications,
including earth observation and surveillance, position tracking
and reporting, automatic identification system and situational
awareness.
In fact, CPS swarms strongly rely on the tight coupling
between the cyber components (i.e., hardware, software,
including control models) and the interactions with the physical
environment (basically through actuators and sensors) [1]. In
contrast to traditional computer systems (e.g., cloud-based
applications), the efficiency and reliability of cyber
components depend not only on the platform and the inter-process
communication, but on the intrinsically unpredictable
interactions with the dynamic environment, as well.
Although emerging CPS swarms are likely to offer highly interesting opportunities for
novel, eventually interoperable real-world services, the design
of their underlying building blocks, such as fault-tolerant
distributed computing, remains quite challenging.
Suitable fault-tolerant distributed computing algorithms are therefore required in
order to guarantee safety and high performance for emerging
distributed services on CPS swarms.
Proposed research
This research project focuses on fault-tolerant techniques and
distributed computing for achieving high-performance, reliable
computing in swarms of cyber-physical systems.
The availability and fault tolerance of a reliable distributed
system are commonly guaranteed by a replication protocol based
on replicated state machine (RSM). Such a protocol implements a
consensus algorithm, like Fast Paxos [2] and Raft [3], in order to
provide strong consistency throughout distributed, replicated
data. In fact, strongly consistent replication is key to
efficient implementation of critical distributed systems'
building blocks, like distributed lock manager or transactional
key-value store.
Strongly consistent replication has been extensively studied in
datacenter infrastructures, hence the vast majority of existing reliable
services based on this kind of replication are still cloud centric. For
instance, ZooKeeper and Chubby[4] rely on strongly consistent replication
to provide reliable lock service for large distributed systems.
Similarly, globally distributed databases, like Google Spanner, use such
a replication to support transactions throughout geo-distributed data.
Yet, strongly consistent replication is likely to be costly in
dynamically changing network topologies of swarms of CPS. Currently, it constitutes
an exciting and a promising, new topic of research. Thus, we propose to
develop and experiment novel distributed algorithms to provide fairly
efficient strongly consistent replication for mobile computing on CPS swarms.
Requirements and application
In this research, we intend to explore both a fundamental and
an applied aspects. In particular, we aim to run real experiments with
swarm of unmanned aerial vehicles (UAVs), commonly known as drones, in
the UAV experimental flight facility of our campus.
Applicants should hold a Master’s degree in Computer Science/Informatics, Mathematics or a related field by the starting date of the doctoral project. They must be excited by research in distributed systems/computing, distributed algorithms, cyber-physical systems, and/or intermittent computing, and should have an excellent academic record in one of these areas. Familiarity with formal specification and verification, and graph theory/algorithms would be greatly appreciated. Teamwork and communication skills are key to this position, and industrial experience is a plus.
Knowledge of French is not required.
To apply, please send the following information to ds-resco-recruitment@lists.recherche.enac.fr(Subject=PhD position[ENAC-MSCA-PhD24]: Distributed computing for CPS swarms):
- Curriculum Vitæ
- Letter of motivation that should describe the applicant's background in the areas of the project, reason for interest in the project, and future plans
- A list of courses and grades of the last three years of study (an informal transcript is OK).
- Names and contact details of at least two people who can write you references, whom we will contact directly.
- If relevant, a link to your publications and/or open-source developments.
This fully-funded doctoral project starts in October 2024 and the duration of the contract/scholarship is 3 years (eventually extendable to 4 years).
Eligibility criteria and Benefits
Applicants of any nationality can apply, but applicants must not have a doctoral degree already or been enrolled in a PhD/doctoral program.
Benefits include:
- French government strongly subsidizes higher education, therefore the tuition fees are among the more competitive in Europe
- Doctoral students are eligible for an accommodation in our own campus which contributes significantly to reduce the total cost of living (already much cheaper than bigger European cities)
- Social security coverage included
- Subsidized meals
- Partial reimbursement of public transport costs
- Social, cultural and sports events and activities
About ENAC research laboratory, cooperation and secondment opportunities
ENAC,
National School of Civil Aviation, is located in Toulouse, France, the centre of the European aerospace industry. Our research laboratory offer ideal working environments, where researchers can focus on developing new ideas, collaborations and projects.
The proposed research project is a joint effort between ENAC and selected partners, in France and abroad, whose common research topics include UAVs systems, sustainable
transportation development, and safety and security of cyber-physical
systems. During the project, the doctoral student will be strongly encouraged to get some secondments abroad. For further information about ENAC, please visit the following site
ENAC.
References
[1] Radhakisan Baheti and Helen Gill. “Cyber-physical systems”.
In: The impact of control technology 12.1 (2011), pp. 161–166.
[2] L. Lamport. Fast Paxos. (2006). Distributed Computing, 19(2).
[3] Ongaro, D., & Ousterhout, J. (2014). In search of an understandable
consensus algorithm. In 2014 USENIX Annual Technical Conference
(USENIX ATC 14) (pp. 305-319).
[4] Burrows, Mike. "The Chubby lock service for loosely-coupled distributed systems." Proceedings of the 7th symposium on Operating systems design and implementation. 2006.