我正在学习python中的文件概念 . 在这里,我想从文本文件中提取一些文本(在 bold format 中) . 但我不知道该怎么做 .
这是我的示例文本文件:
{"glossary":{"title":"example glossary","GlossDiv":{ "title": "S" ,"GlossList":{"GlossEntry":{"ID":"SGML","SortAs":"SGML","GlossTerm":"Standard Generalized Markup Language","Acronym":"SGML","Abbrev":"ISO 8879:1986","GlossDef":{ "para": "A meta-markup language, used to create markup languages such as DocBook." ,"GlossSeeAlso":["GML","XML" ]},"GlossSee":"markup"}}}}}
{"glossary":{"title":"example glossary","GlossDiv":{ "title": "a" ,"GlossList":{"GlossEntry":{"ID":"SGML","SortAs":"SGML","GlossTerm":"Standard Generalized Markup Language","Acronym":"SGML","Abbrev":"ISO 8879:1986","GlossDef":{ "para": "A meta-markup language, used to create markup languages such as DocBook." ,"GlossSeeAlso":["GML","XML" ]},"GlossSee":"markup"}}}}}
{"glossary":{"title":"example glossary","GlossDiv":{ "title": "b" ,"GlossList":{"GlossEntry":{"ID":"SGML","SortAs":"SGML","GlossTerm":"Standard Generalized Markup Language","Acronym":"SGML","Abbrev":"ISO 8879:1986","GlossDef":{ "para": "A meta-markup language, used to create markup languages such as DocBook." ,"GlossSeeAlso":["GML","XML" ]},"GlossSee":"markup"}}}}}
{"glossary":{"title":"example glossary","GlossDiv":{ "title": "c" ,"GlossList":{"GlossEntry":{"ID":"SGML","SortAs":"SGML","GlossTerm":"Standard Generalized Markup Language","Acronym":"SGML","Abbrev":"ISO 8879:1986","GlossDef":{ "para": "A meta-markup language, used to create markup languages such as DocBook." ,"GlossSeeAlso":["GML","XML" ]},"GlossSee":"markup"}}}}}
{"glossary":{"title":"example glossary","GlossDiv":{ "title": "f" ,"GlossList":{"GlossEntry":{"ID":"SGML","SortAs":"SGML","GlossTerm":"Standard Generalized Markup Language","Acronym":"SGML","Abbrev":"ISO 8879:1986","GlossDef":{ "para": "A meta-markup language, used to create markup languages such as DocBook." ,"GlossSeeAlso":["GML","XML" ]},"GlossSee":"markup"}}}}
我需要像这样的输出:
"title": "S" ; "para": "A meta-markup language, used to create markup languages such as DocBook."
"title": "a" ; "para": "A meta-markup language, used to create markup languages such as DocBook."
....
任何人都可以帮我在python中做到这一点?
1 回答
由于没有尝试显示任何代码,我只会给出一些一般性的建议 . 文件中的每一行实际上都是一个称为
dictionary
的Python数据结构 . 您可以读取每一行并使用Python的json
库来解析它并将其存储为Dictionary(dict
) . 有关Python词典的更多信息,请访问here:从文件中读取多个JSON对象的一些想法可以在StackOverflow question的接受答案中找到 . 特别是最后一个示例代码段 .
读入字典后,您可以对数据执行字典查找 . 如果我将字典读入名为
dictvar
的变量中,那么我可以通过执行以下操作来打印glossary
title
:如果我正在阅读上面文件中的第一个例子,那将打印:
实际上,您可以使用最少量的代码生成一个可以执行所需操作的程序