Skip to content

Python backend that processes university data into text embeddings using the OpenAI API and provides a search API for finding relevant results.

Notifications You must be signed in to change notification settings

Scholar-Compass/Scholar-Compass-backend

Repository files navigation

Scholar Compass' Backend

流程

准备数据

flowchart TB
飞书[飞书<br>各校文档] -->|导出| Word -->|转换| md_uni/*.md
飞书 -->|开头加链接| md_uni/*.md
人工 --> md_other/常见问题汇总.md
Markdown ==>|<code>md_to_csv.py</code><br>转换格式| CSV

md_uni/*.md -.-> csv_to_embed/*.csv
md_other/常见问题汇总.md -.-> csv_to_embed/*.csv
md_uni/*.md -.-> csv_other/links.csv

subgraph Markdown
    direction TB
    md_uni/*.md
    md_other/常见问题汇总.md
end

subgraph CSV
    direction TB
    csv_other/links.csv[csv_other/links.csv<br>各校文档的链接]
    csv_to_embed/*.csv[csv_to_embed/*.csv<br>每一节对应的各级标题和学校]
end

csv_to_embed/*.csv ==>|<code>embedding.py</code><br>via OpenAI API| embedding/*.csv[embedding/*.csv<br>text + embedding]

csv_other/links.csv  --> 后续
embedding/*.csv  --> 后续([后续使用])
Loading

另外,用read_csv.py可检视生成的embedding/*.csv

后端

flowchart LR
前端 -->|"POST <code>/query</code><br><code>{ question: string }</code>"| api.py[<code>api.py</code><br>Flask]
api.py -->|"<code>{ answer: string }</code>"| 前端
api.py <-->|<code>search.py</code>| OpenAI[OpenAI API]
Loading

search.py从 OpenAI 生成答案步骤如下。

  1. ask()

    [!WARNING]

    ask()history参数似乎实际未使用。

    1. query_message()

      1. strings_ranked_by_relatedness()根据输入的问题,利用openai.Embedding选出相关学校。
      2. 在问题之前补充“大学信息”等提示。
    2. openai.ChatCompletion提问并返回答案。

  2. add_link()

    csv_other/links.csv匹配校名,添加相应链接。

About

Python backend that processes university data into text embeddings using the OpenAI API and provides a search API for finding relevant results.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages