To train OceanGPT(沧渊), we collected an ocean science corpus that spans multiple fields. Since each subfield and topic has its unique data characteristics and patterns, we proposed a domain-specific instruction generation framework called DoInstruct. We trained OceanGPT based on open-source models (such as Qwen, LLaMA, MiniCPM, etc.).
Disclaimer: This project is purely an academic exploration rather than a product. Please be aware that due to the inherent limitations of large language models, there may be issues such as hallucinations.
OceanGPT(沧渊)专为海洋领域而设计,可以处理各种海洋科学任务,包括海洋相关的问答和内容生成。此外,我们试图验证 OceanGPT 在模拟水下具身智能方面的潜力。该模型仍然存在幻觉等局限性,我们将继续维护 OceanGPT,旨在增强其在海洋研究和探索中的实际应用能力。