Cloud or On-Premise? A Strategic View of Large Language Model Deployment
| dc.contributor.author | Shi, Jiaqi | |
| dc.contributor.author | Zhang , Zhoupeng (Jack) | |
| dc.contributor.author | Tang, Shaojie | |
| dc.date.accessioned | 2025-12-23T16:35:23Z | |
| dc.date.available | 2025-12-23T16:35:23Z | |
| dc.date.issued | 2026-01-06 | |
| dc.description.abstract | Large language models (LLMs) have advanced rapidly in recent years. We examine a critical decision faced by an LLM provider: whether to provide a local (on-premise) service channel in addition to cloud services. We develop a game-theoretical queueing model to analyze the economic and welfare implications of introducing an on-premise model. Our results show that offering the localization option can reduce the provider's optimal profit due to market cannibalization, yet increase users' overall surplus. Such market outcomes can be reinforced by users' privacy concerns, but may reverse when users differ significantly in their service valuations, as localization enables the provider to extract users' surplus more effectively. When localization is offered through a third party, price discrimination can further increase surplus extraction; however, the double marginalization along the AI supply chain may offset these gains. Finally, in competitive markets, localization may prompt an entrant to lower the quality of their cloud services to limit cannibalization, thereby softening price competition with the incumbent to some extent. Overall, our analysis highlights the strategic trade-offs in LLM deployment and provides guidance on pricing and localization decisions. | |
| dc.format.extent | 10 pages | |
| dc.identifier.doi | https://doi.org/10.24251/HICSS.2026.121 | |
| dc.identifier.isbn | 978-0-9981331-9-5 | |
| dc.identifier.other | 89d4a3e8-15d0-4b2e-ac96-8162cb8482fc | |
| dc.identifier.uri | https://hdl.handle.net/10125/111515 | |
| dc.language.iso | eng | |
| dc.relation.ispartof | Proceedings of the 59th Hawaii International Conference on System Sciences | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.subject | AI, Platforms, and Ecosystems in Digital Services | |
| dc.subject | generative ai economics | |
| dc.subject | large language models | |
| dc.subject | on-premise deployment | |
| dc.subject | pricing strategies | |
| dc.title | Cloud or On-Premise? A Strategic View of Large Language Model Deployment | |
| dc.type | Conference Paper | |
| dc.type.dcmi | Text | |
| prism.startingpage | 1010 |
Files
Original bundle
1 - 1 of 1
