Optimization of protocols
| dc.contributor.advisor | Pavlovic, Dusko | |
| dc.contributor.author | Hernandez, Oscar Ivan | |
| dc.contributor.department | Computer Science | |
| dc.date.accessioned | 2025-09-30T22:32:20Z | |
| dc.date.available | 2025-09-30T22:32:20Z | |
| dc.date.issued | 2025 | |
| dc.description.degree | M.S. | |
| dc.identifier.uri | https://hdl.handle.net/10125/111289 | |
| dc.subject | Computer science | |
| dc.subject | Artificial intelligence | |
| dc.subject | Information technology | |
| dc.subject | artificial intelligence | |
| dc.subject | induction | |
| dc.subject | multi-agent system | |
| dc.subject | post-training | |
| dc.subject | protocol | |
| dc.subject | security | |
| dc.title | Optimization of protocols | |
| dc.type | Thesis | |
| dcterms.abstract | The development in capabilities of artificial intelligence brings the increased participation of intelligent machines in the protocols of computer networks and society, playing some roles earmarked for machines and others ripe for deception. This exacerbates existing concerns, and it introduces a new dimension to the problems of privacy \& security. Whereas a cryptographic protocol can be analyzed with formal methods in terms of the properties of traces it produces, the probabilistic protocols involving potentially deceitful AI participants are analyzed in terms of probability distributions over its traces. In contrast with formal specifications of explicit requirements, the requirements for such AI protocols are specified informally and implicitly by reward models trained on data. A mathematical model of protocol post-training is proposed in terms of an objective function defined by such rewards and regularized by statistical distances from the pre-trained behaviors. It is shown that any instance of such a protocol post-training problem admits solutions at a level of generality that does not depend on particular details of algorithms or computational paradigms, thus showing the existence of optimal behaviors that learning algorithms aim to represent in a way that applies to reinforcement learning algorithms and algorithms in any other paradigm of learning. This establishes the proposed model of protocol post-training as a general setting for reasoning about the opportunities and limitations of protocols involving AI actors. | |
| dcterms.extent | 79 pages | |
| dcterms.language | en | |
| dcterms.publisher | University of Hawai'i at Manoa | |
| dcterms.rights | All UHM dissertations and theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission from the copyright owner. | |
| dcterms.type | Text | |
| local.identifier.alturi | https://www.proquest.com/LegacyDocView/DISSNUM/32121122 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Hernandez_hawii_0085O_12697.pdf
- Size:
- 784.06 KB
- Format:
- Adobe Portable Document Format
