cleaned titles

This commit is contained in:
Sanj 2012-01-31 06:01:33 +05:30
parent 962549f0f7
commit c9f0ae7cf3
3 changed files with 14571 additions and 14557 deletions

File diff suppressed because it is too large Load Diff

View File

@ -2,6 +2,7 @@ from jinja2 import Template
from os.path import join
import json
import codecs
import re
def do():
data = json.load(open("chronoPeople.json"))
@ -26,6 +27,19 @@ def do():
out.write(s)
out.close()
def clean_titles():
data = json.load(open("chronoPeople.json"))
for d in data:
title = d['title']
d['title'] = cleanTitle(title)
out = open("chronoPeople.json", "w")
out.write(json.dumps(data, indent=2))
out.close()
def cleanTitle(string):
regex = re.compile(r'\(.*?\)')
return re.sub(regex, "", string).strip()
def peoplejson():
f = open("people.txt")
o = ''
@ -39,7 +53,7 @@ def peoplejson():
# out.write(json.dumps(json.loads(d), indent=2))
# out.close()
'''
def doPersons():
radias = json.load(open("chronoPrint.json"))
people = json.load(open("people.json"))
@ -52,7 +66,7 @@ def doPersons():
o = open("chronoRadiaPeople.json", "w")
o.write(json.dumps(radias, indent=2))
o.close()
'''
def radiaToChrono():
radias = json.load(open("radiaPeople.json"))
@ -66,7 +80,7 @@ def radiaToChrono():
out.write(json.dumps(chronos))
out.close()
'''
def addIndexes():
people = json.load(open("chronoRadiaPeople.json"))
x = 0
@ -87,4 +101,4 @@ def addIndexes():
out2 = open("peopleToBeFixed.json", "w")
out2.write(json.dumps(short, indent=2))
out2.close()
'''

View File

@ -7684,7 +7684,7 @@ NKS: And your people are doing a fabulous job on the, on the (inaudible) and pub
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">42. </span>Radia Tapes: Venkat (Executive Secretary to Ratan Tata), Radia</h3>
<h3 class="title"><span class="counter">42. </span>Radia Tapes: Venkat , Radia</h3>
<div class="people"><span class="personName">Venkat</span>: Executive Secretary to R. Tata</div>
@ -14011,7 +14011,7 @@ Niira Radia: OK then.
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">65. </span>Radia Tapes: Radia, Rashmi (ET)</h3>
<h3 class="title"><span class="counter">65. </span>Radia Tapes: Radia, Rashmi</h3>
<div class="people"><span class="personName">Rashmi Pratap</span>: Journalist, Economic Times</div>
@ -16648,7 +16648,7 @@ Ratnam: I have ma'am. Give me a missed call from that number <i>na</i> -</pre>
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">73. </span>Radia Tapes: Radia, G.Ganapathy Subramanium (ET)</h3>
<h3 class="title"><span class="counter">73. </span>Radia Tapes: Radia, G.Ganapathy Subramanium</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>
@ -19917,7 +19917,7 @@ Niira: Ok. Bye.
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">85. </span>Radia Tapes: Radia, G.Ganapathy Subramanium (ET)</h3>
<h3 class="title"><span class="counter">85. </span>Radia Tapes: Radia, G.Ganapathy Subramanium</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>
@ -20037,7 +20037,7 @@ Niira Radia: No because there's a -
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">86. </span>Radia Tapes: Radia, Prasad (Tata Adviser)</h3>
<h3 class="title"><span class="counter">86. </span>Radia Tapes: Radia, Prasad</h3>
<div class="people"><span class="personName">PMS Prasad</span>: Executive Director, Reliance Industries Ltd.</div>
@ -20836,7 +20836,7 @@ Niira: See, I did tell them both on Friday and they were supposed to... I'll jus
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">88. </span>Radia Tapes: Radia, R.Sridharan (ET Now)</h3>
<h3 class="title"><span class="counter">88. </span>Radia Tapes: Radia, R.Sridharan</h3>
<div class="people"><span class="personName">Sridharan Ramakrishnan/R. Sridharan</span>: Senior Editor (News & Trends), ET NOW</div>
@ -21205,7 +21205,7 @@ Niira Radia: I will do that -
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">89. </span>Radia Tapes: Radia, Manish (Bangalore Office)</h3>
<h3 class="title"><span class="counter">89. </span>Radia Tapes: Radia, Manish</h3>
<div class="people"><span class="personName">Manish</span>: Employee, Bangalore office, Vaishnavi Corporate Communications, (Radia's company)</div>
@ -21916,7 +21916,7 @@ S: Hmm, hmm, hmm.</pre>
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">92. </span>Radia Tapes: Radia, Shalini Singh (Tata Group)</h3>
<h3 class="title"><span class="counter">92. </span>Radia Tapes: Radia, Shalini Singh</h3>
<div class="people"><span class="personName">Shalini Singh</span>: Head, Corporate Communications, TATA Power</div>
@ -22166,7 +22166,7 @@ SH: I'll let you know. I'll keep you posted. I am just waiting for those drafts,
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">93. </span>Radia Tapes: Radia, Rohit Khanna (Vaishnavi)</h3>
<h3 class="title"><span class="counter">93. </span>Radia Tapes: Radia, Rohit Khanna</h3>
<div class="people"><span class="personName">Rohit Khanna</span>: Associate Director, Vaishnavi Corporate Communications, (Radia's company)</div>
@ -23688,7 +23688,7 @@ NR: Okay, okay. Bye.</pre>
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">99. </span>Radia Tapes: Radia, G.Ganapathy Subramanium (ET)</h3>
<h3 class="title"><span class="counter">99. </span>Radia Tapes: Radia, G.Ganapathy Subramanium</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>
@ -23857,7 +23857,7 @@ G. Ganapathy Subramaniam: Yeah (unclear)</pre>
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">100. </span>Radia Tapes: Radia, G.Ganapathy Subramanium (ET)</h3>
<h3 class="title"><span class="counter">100. </span>Radia Tapes: Radia, G.Ganapathy Subramanium</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>
@ -24469,7 +24469,7 @@ Radia: Okay fine. Okay. So I'll just say that it was a last board meeting as a l
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">104. </span>Radia Tapes: Radia with unknown (Mohan?)</h3>
<h3 class="title"><span class="counter">104. </span>Radia Tapes: Radia with unknown</h3>
<div class="datetime">
<div class="date">Date: Friday 19, June 2009</div>
@ -29933,7 +29933,7 @@ Caller: 10 July, ok, <i>bata sakti hain, mujhe ki</i> London <i>karna hai</i>, N
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">127. </span>Radia Tapes: Radia, G.Ganapathy Subramanium (ET)</h3>
<h3 class="title"><span class="counter">127. </span>Radia Tapes: Radia, G.Ganapathy Subramanium</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>
@ -30439,7 +30439,7 @@ Radia: ok, thanks.
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">129. </span>Radia Tapes: Radia, G.Ganapathy Subramanium (ET)</h3>
<h3 class="title"><span class="counter">129. </span>Radia Tapes: Radia, G.Ganapathy Subramanium</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>
@ -31069,7 +31069,7 @@ G. Ganapathy Subramaniam: Thanks -
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">130. </span>Radia Tapes: Radia, G.Ganapathy Subramanium (ET)</h3>
<h3 class="title"><span class="counter">130. </span>Radia Tapes: Radia, G.Ganapathy Subramanium</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>
@ -32570,7 +32570,7 @@ Tarun Das: Yeah...
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">137. </span>Radia Tapes: Radia, G Ganapathy Subramaniam (Pressurising ET)</h3>
<h3 class="title"><span class="counter">137. </span>Radia Tapes: Radia, G Ganapathy Subramaniam</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>
@ -36863,7 +36863,7 @@ Padmanabhan: Okay, okay.</pre>
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">157. </span>Radia Tapes: Radia, Rakesh Hari Pathak (Economic Bureau Chief PTI)</h3>
<h3 class="title"><span class="counter">157. </span>Radia Tapes: Radia, Rakesh Hari Pathak</h3>
<div class="people"><span class="personName">Rakesh Hari Pathak</span>: Economic Bureau Chief, PTI</div>
@ -37433,7 +37433,7 @@ Niira Radia: OK, see you, bye.
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">159. </span>Radia Tapes: Radia, Jaideep Bose (TOI)</h3>
<h3 class="title"><span class="counter">159. </span>Radia Tapes: Radia, Jaideep Bose</h3>
<div class="people"><span class="personName">Jaideep Bose</span>: Editor-in-Chief, Times of India</div>
@ -37846,7 +37846,7 @@ Jaideep Bose: Its comes home to roost other people, not necessarily for the pers
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">160. </span>Radia Tapes: Radia, G Ganapathy Subramaniam (ET)</h3>
<h3 class="title"><span class="counter">160. </span>Radia Tapes: Radia, G Ganapathy Subramaniam</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>
@ -44284,7 +44284,7 @@ Manoj: Okay bye.</pre>
<div class="fileWrapper">
<div class="headerWrapper">
<h3 class="title"><span class="counter">181. </span>Radia Tapes: Radia, G.Ganapathy Subramanium (ET)</h3>
<h3 class="title"><span class="counter">181. </span>Radia Tapes: Radia, G.Ganapathy Subramanium</h3>
<div class="people"><span class="personName">G. Ganapathy Subramaniam</span>: National Policy Editor, ET NOW and Assistant Editor, Economic Times</div>