数据量非常之大,以下是我用notepad2打开数据文件的截取内容:
ID 3HAO_DICDI Reviewed; 182 AA.
AC Q54S02;
DT 08-APR-2008, integrated into UniProtKB/Swiss-Prot.
DT 24-MAY-2005, sequence version 1.
DE 3-hydroxyanthranilate 3,4-dioxygenase (EC 1.13.11.6) (3-
DE hydroxyanthranilic acid dioxygenase) (3-hydroxyanthranilate oxygenase)
GN Name=haao; ORFNames=DDB_0231359;
OS Dictyostelium discoideum (Slime mold).
//
ID ZPR1_PAPAN Reviewed; 459 AA.
AC A9CB27;
DT 08-APR-2008, integrated into UniProtKB/Swiss-Prot.
DT 15-JAN-2008, sequence version 1.
DT 08-APR-2008, entry version 3.
DE Zinc finger protein ZPR1 (Zinc finger protein 259).
GN Name=ZNF259; Synonyms=ZPR1;
OS Papio anubis (Olive baboon).
//
可以看到列名应该是ID,AC,DT,DE,GN,OS
这种数据文件跟我之前遇到的那种普通的类型不太一致,请问我如何才能将之导入到oracle中去呢?
ID 3HAO_DICDI Reviewed; 182 AA.
AC Q54S02;
DT 08-APR-2008, integrated into UniProtKB/Swiss-Prot.
DT 24-MAY-2005, sequence version 1.
DE 3-hydroxyanthranilate 3,4-dioxygenase (EC 1.13.11.6) (3-
DE hydroxyanthranilic acid dioxygenase) (3-hydroxyanthranilate oxygenase)
GN Name=haao; ORFNames=DDB_0231359;
OS Dictyostelium discoideum (Slime mold).
//
ID ZPR1_PAPAN Reviewed; 459 AA.
AC A9CB27;
DT 08-APR-2008, integrated into UniProtKB/Swiss-Prot.
DT 15-JAN-2008, sequence version 1.
DT 08-APR-2008, entry version 3.
DE Zinc finger protein ZPR1 (Zinc finger protein 259).
GN Name=ZNF259; Synonyms=ZPR1;
OS Papio anubis (Olive baboon).
//
可以看到列名应该是ID,AC,DT,DE,GN,OS
这种数据文件跟我之前遇到的那种普通的类型不太一致,请问我如何才能将之导入到oracle中去呢?
AC Q54S02;
DT 08-APR-2008, integrated into UniProtKB/Swiss-Prot.
DT 24-MAY-2005, sequence version 1.
DE 3-hydroxyanthranilate 3,4-dioxygenase (EC 1.13.11.6) (3-
DE hydroxyanthranilic acid dioxygenase) (3-hydroxyanthranilate oxygenase)
GN Name=haao; ORFNames=DDB_0231359;
OS Dictyostelium discoideum (Slime mold). 当成一个字符串处理 ;导入之后,按列名 ID,AC,DT,DE,GN,OS 或者能够区分的格式,写存储过程, 重新解析即可