{"id":1058,"date":"2025-10-07T13:39:22","date_gmt":"2025-10-07T17:39:22","guid":{"rendered":"https:\/\/carleton.ca\/rcs\/?page_id=1058"},"modified":"2026-06-15T15:22:38","modified_gmt":"2026-06-15T19:22:38","slug":"llm-access","status":"publish","type":"page","link":"https:\/\/carleton.ca\/rcs\/llm-access\/","title":{"rendered":"LLM Access"},"content":{"rendered":"\n<section class=\"w-screen px-6 cu-section cu-section--white ml-offset-center md:px-8 lg:px-14\">\n    <div class=\"space-y-6 cu-max-w-child-5xl  md:space-y-10 cu-prose-first-last\">\n\n            <div class=\"cu-textmedia flex flex-col lg:flex-row mx-auto gap-6 md:gap-10 my-6 md:my-12 first:mt-0 max-w-5xl\">\n        <div class=\"justify-start cu-textmedia-content cu-prose-first-last\" style=\"flex: 0 0 100%;\">\n            <header class=\"font-light prose-xl cu-pageheader md:prose-2xl cu-component-updated cu-prose-first-last\">\n                                    <h1 class=\"cu-prose-first-last font-semibold !mt-2 mb-4 md:mb-6 relative after:absolute after:h-px after:bottom-0 after:bg-cu-red after:left-px text-3xl md:text-4xl lg:text-5xl lg:leading-[3.5rem] pb-5 after:w-10 text-cu-black-700 not-prose\">\n                        LLM Access\n                    <\/h1>\n                \n                                \n                            <\/header>\n\n                    <\/div>\n\n            <\/div>\n\n    <\/div>\n<\/section>\n\n\n\n<p>The Research and Computing Services created a Large Language Model server to provide access to the Carleton University community. REST API endpoints can be accessed upon request. There are many models deployed in this server, including:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>all-minilm:33m<\/li>\n\n\n\n<li>all-minilm:latest<\/li>\n\n\n\n<li>command-r-plus:latest<\/li>\n\n\n\n<li>command-r:latest<\/li>\n\n\n\n<li>embeddinggemma:latest<\/li>\n\n\n\n<li>gemma3:27b<\/li>\n\n\n\n<li>gemma4:26b<\/li>\n\n\n\n<li>gemma4:31b<\/li>\n\n\n\n<li>gemma4:latest<\/li>\n\n\n\n<li>gpt-oss:120b<\/li>\n\n\n\n<li>gpt-oss:20b<\/li>\n\n\n\n<li>llama3.1:latest<\/li>\n\n\n\n<li>llama3.2-vision:90b<\/li>\n\n\n\n<li>llama3.2:latest<\/li>\n\n\n\n<li>llama3.3:latest<\/li>\n\n\n\n<li>llama4:maverick<\/li>\n\n\n\n<li>llama4:scout<\/li>\n\n\n\n<li>mistral-large:latest<\/li>\n\n\n\n<li>mixtral:8x22b<\/li>\n\n\n\n<li>mxbai-embed-large:latest<\/li>\n\n\n\n<li>qwen2.5:latest<\/li>\n\n\n\n<li>qwen3-coder-next:q8_0<\/li>\n\n\n\n<li>qwen3-embedding:latest<\/li>\n\n\n\n<li>qwen3.5:122b<\/li>\n\n\n\n<li>qwen3.5:27b<\/li>\n\n\n\n<li>qwen3.5:35b<\/li>\n\n\n\n<li>qwen3.5:9b<\/li>\n\n\n\n<li>qwen3:235b<\/li>\n\n\n\n<li>qwen3:32b<\/li>\n\n\n\n<li>qwen3:8b<\/li>\n\n\n\n<li>translategemma:latest<\/li>\n<\/ul>\n\n\n\n<p>The backend uses Ollama to deploy models. You can use the instructions from&nbsp;<a href=\"https:\/\/docs.ollama.com\/quickstart\">Ollama docs<\/a>, but provide the variable \u201cx-api-key\u201d with your RCS API Key, which can be requested&nbsp;<a href=\"https:\/\/carleton.ca\/rcs\/llm-access\/\">here<\/a>. If approved, you will receive information about the host and API Key.<\/p>\n\n\n\n<p><strong class=\"myprefix-text-bold\">The code below is an example of access using the Ollama package in Python:<\/strong><\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>from ollama import Client\nfrom argparse import ArgumentParser\n\nDEFAULT_HOST = \u201chttps:\/\/rcsllm.carleton.ca\/rcsapi\u201d\nparser = ArgumentParser(description=\u201dOllama API Client Example\u201d)\nparser.add_argument(\u201c\u2013host\u201d, type=str, help=\u201dOllama server host\u201d, required=False, default=DEFAULT_HOST)\nparser.add_argument(\u201c\u2013model\u201d, type=str, help=\u201dModel to use for requests\u201d, required=False, default=\u201dgpt-oss:120b\u201d)\nparser.add_argument(\u201c\u2013prompt\u201d, type=str, help=\u201dPrompt to send to the model\u201d, required=False, default=\u201dWhat is an LLM and how are you compared to other models?\u201d)\nparser.add_argument(\u201c\u2013stream\u201d, action=\u201dstore_true\u201d, help=\u201dStream the response\u201d, required=False, default=True) parser.add_argument(\u201c\u2013api_key\u201d, type=str, help=\u201dAPI Key for authentication\u201d, required=True)\nparser.add_argument(\u201c\u2013list_models\u201d, action=\u201dstore_true\u201d, help=\u201dList available models\u201d, required=False, default=False)\n\nif __name__ == \u201c__main__\u201d:\n    args = parser.parse_args()\n    api_key = args.api_key\n    custom_header = {\u201cx-api-key\u201d: api_key} if api_key else {}\n    ollama_client = Client(host=args.host, headers=custom_header)\n    if args.list_models:\n        models = ollama_client.list()\n        print(\u201cAvailable models:\u201d)\n        for model in models.models:\n            print(f\u201d- {model.model}\u201d)\n        exit(0)\n    stream = args.stream\n    response = ollama_client.chat(\n        model=args.model,\n        messages=&#91;\n            {\u201crole\u201d: \u201cuser\u201d, \u201ccontent\u201d: args.prompt} ],\n        stream=stream ) if stream: for chunk in response: print(chunk&#91;\u2018message\u2019]&#91;\u2018content\u2019], end=\u201d\u201d, flush=True) else: print(response&#91;\u2018message\u2019]&#91;\u2018content\u2019])<\/code><\/pre>\n\n\n\n<p>If you wish to get access to this resource, fill out the form available&nbsp;<a href=\"https:\/\/itsjira.carleton.ca\/servicedesk\/customer\/portal\/5\/create\/893\">here<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Research and Computing Services created a Large Language Model server to provide access to the Carleton University community. REST API endpoints can be accessed upon request. There are many models deployed in this server, including: The backend uses Ollama to deploy models. You can use the instructions from&nbsp;Ollama docs, but provide the variable \u201cx-api-key\u201d [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"_cu_dining_location_slug":"","footnotes":"","_links_to":"","_links_to_target":""},"cu_page_type":[],"class_list":["post-1058","page","type-page","status-publish","hentry"],"acf":{"cu_post_thumbnail":""},"_links":{"self":[{"href":"https:\/\/carleton.ca\/rcs\/wp-json\/wp\/v2\/pages\/1058","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/carleton.ca\/rcs\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/carleton.ca\/rcs\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/carleton.ca\/rcs\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/carleton.ca\/rcs\/wp-json\/wp\/v2\/comments?post=1058"}],"version-history":[{"count":5,"href":"https:\/\/carleton.ca\/rcs\/wp-json\/wp\/v2\/pages\/1058\/revisions"}],"predecessor-version":[{"id":1200,"href":"https:\/\/carleton.ca\/rcs\/wp-json\/wp\/v2\/pages\/1058\/revisions\/1200"}],"wp:attachment":[{"href":"https:\/\/carleton.ca\/rcs\/wp-json\/wp\/v2\/media?parent=1058"}],"wp:term":[{"taxonomy":"cu_page_type","embeddable":true,"href":"https:\/\/carleton.ca\/rcs\/wp-json\/wp\/v2\/cu_page_type?post=1058"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}