-
Notifications
You must be signed in to change notification settings - Fork 4
/
Copy pathindex.html
292 lines (250 loc) · 14.8 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
<!DOCTYPE html>
<!-- saved from url=(0039) -->
<html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>Sab-AI Lab Japan-Machine Learning</title>
<meta name="description" content="Technologies Landing Page - Lab/Portfolio/Tech Landing Page">
<meta name="keywords" content="AI, machine learning, education, edtech, speech recognition, language identification, speech to text, Japan, technology, cognetive, neural network, e-learning, quantum programming, programming, medical, quantum computer, language">
<meta name="author" content="Themedept">
<link rel="shortcut icon" href="images/favicon.ico" type="image/x-icon">
<link rel="icon" href="images/favicon.ico" type="image/x-icon">
<link rel="stylesheet" href="css/bootstrap.min.css">
<link rel="stylesheet" href="css/style.css">
<link rel="stylesheet" href="css/responsive.css">
<!--[if lt IE 9]>
<script src="https://oss.maxcdn.com/html5shiv/3.7.2/html5shiv.min.js"></script>
<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
<![endif]-->
<!-- Global site tag (gtag.js) - Google Analytics -->
<script async src="https://www.googletagmanager.com/gtag/js?id=UA-133948123-1"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag(){dataLayer.push(arguments);}
gtag('js', new Date());
gtag('config', 'UA-133948123-1');
</script>
</head>
<body >
<div class="loader bg-white" style="display: none;">
<div class="loader-inner ball-scale-ripple-multiple vh-center" style="display: none;">
<div></div>
<div></div>
<div></div>
</div>
</div>
<div class="main-container" id="page">
<section id="hero" class="bg-color content-align-md">
<div class="container">
<div class="row y-middle c2">
<img src="assets/images/logo5nn.png" alt="" />
<div class="col-lg-9 center-md text-white text-center">
<video width="520" height="420" autoplay loop muted>
<source src="images/mem.mp4" type="video/mp4">
Your browser does not support the video tag.
</video>
<h2 class="m-b-lg m-t intro-demo">Welcome to <span style="color:#26B897;"> Sab-AI lab</span></h2>
<p class="lead m-b-md f-w-700">A boutique AI lab in Nagoya-Japan.</p>
<p class="lead m-b-md f-w-700"><span style="color:#26B897;">PDFs</span> are notoriously difficult to scrape. This program <span style="color:#26B897;">converts</span> them to<span style="color:#26B897;"> *.txt or *.html </span>formats. The program has tested for Latin alphabets and Japanese.</p>
<p class="lead m-b-md f-w-700">The narrative lays out the technology's scope of works, accuracy, the-best-use and way-forwards.</p>
<h2>...</h2>
<h3>Datasets and models download: </h3>
<p></p>
<form action="https://github.com/Shahabks/Converter-pdf-files-to-.txt-or-.html/releases" onsubmit="if(document.getElementById('agree').checked) { return true; } else { alert('By checking the box please indicate that you have read and agree to the Terms & Conditions and Policy'); return false; }">
<input type="checkbox" name="checkbox" value="check" id="agree" /> By downloading this source code I acknowledge that I have fully read and understood the below system's scope and description as well as its behaviour/acceptance test criteria in its entirety and
am considering all requirements when I build upon/use the system to keep it performing as expressed.
<p></p>
<input type="submit" name="Agree" value="Download" style="color:black"/>
</form>
<h2>...</h2>
<p style="color:red;"> note: This program cannot open encrypted PDF, Before using this program you need to decrypt your pdf file
</p>
<h2>...</h2>
</div>
</div>
</div>
</div>
</section>
<!-- Home Section -->
<section id="home" class="home">
<div class="container">
<div class="row">
<div class="col-sm-12 ">
<div class="main_home text-center colorstext wow fadeInUp" data-wow-duration="700ms">
<h2 style="color:white;">...</h2>
<h2>Converter-pdf-files-to-.txt-or-.html</h2>
<p> </p>
<p style="font-size:120%;">I built this package on the work of Gorkovenko (Stanford University) and
Greenfield (Harvard University) to make pdfminer.six available for Python versions 3.x.
PDFs are notoriously difficult to scrape. Converting them to text files can make extracting their data
significantly easier. There are several tools out there to help you do this, but I will focus on the
one that I think is the best and easiest to use: pdfminer.six Converting *.pdf to *.txt or *.html I
made a standalone executable version of the package ready testpdf2txt.exe. You could download and use
it even if you do not have python 3 installed on your machine.</p>
<p style="font-size:120%;"> This is the results from <span style="color:#26B897;">an improvement work </span> on a project called the Mysolution information extraction algorithm
from unstructured datasets with an overall accuracy of <span style="color:#26B897;">99% </span>.<p>
</div>
</div><!-- End of col-sm-12 -->
</div><!-- End of row -->
</div><!-- End of Container -->
<div class="home_socail_bar">
<ul>
<li><a href="https://my-solutionailab.slack.com/messages/CFPMZ4GGY/"><i class="fa fa-slack"></i></a></li>
<li><a href="https://twitter.com/mysolEdTech?lang=en"><i class="fa fa-twitter"></i></a></li>
</ul>
</div>
</section><!-- End of Home Section -->
<!-- About Section -->
<section id="about" class="about colorsbg">
<div class="container-fluid">
<div class="row">
<div class="col-sm-6 col-sm-offset-1">
<div class="single_abt single_about_right_img">
<img src="assets/images/pdf2txt.jpg" alt="" />
</div>
</div>
<div class="col-sm-12 ">
<div class="main_about_area">
<div class="head_title text-center">
<h2>please download ---testpdf2txt.exe--- the click above.</h2>
</div>
<div class="about_content colorstext2 wow fadeInUp" data-wow-duration="700ms">
<div class="row">
<div class="col-sm-4 col-sm-offset-1">
<div class="single_abt single_about">
<p> You can save the program anywhere in your computer and run it by double-clicking on it directly from your machine.</p>
<UL>
<LI style="font-size:100%;" >Put your PDF file in a folder,
<p></p>
<LI style="font-size:100%;" >Double-click the program and follow the instruction on the screen,
<p></p>
<LI style="font-size:100%;" > You may save *.txt and *.html in a different directory, please enter the path to those directory if you wish.
<LI style="font-size:100%;" >Enter the filename of your PDF.
</UL>
<p> </p>
</div>
</div>
</div>
</div>
</div><!-- End of col-sm-12 -->
</div><!-- End of row -->
</div><!-- End of Container-fluid -->
<div class="service_more_btn padding-top-80 text-center">
<h2>...</h2>
<h2 align="center"> A quick perfrmance report on ML </h2>
<table align="center" border="1">
<tr>
<th>Dataset</th>
<th>f1</th>
<th>Accuracy</th>
<th>Precision</th>
<th>Recall (Sensitivity)</th>
</tr>
<tr>
<td>For non-native</td>
<td>74%</td>
<td>72%</td>
<td>78%</td>
<td>76%</td>
</tr>
<tr>
<td>For Japanese-English speaker</td>
<td>78%</td>
<td>79%</td>
<td>81%</td>
<td>78%</td>
</tr>
</table>
<h2>...</h2>
<a href="https://github.com/Shahabks/Converter-pdf-files-to-.txt-or-.html/blob/master/LICENSE" <span style="color:#26B897;"> The PDF-to-TXT-HTML source code is licensed under MIT General Public License </span></a>
<h2>...</h2>
</div>
</div>
</div><!-- End of main service area -->
</div><!-- End of row -->
</div><!-- End of container -->
</section><!-- End of Service Section -->
<!-- Contact Section -->
<section id="contactus" class="contactus colorsbg">
<div class="container">
<div class="row">
<div class="main_service_area main_contactus_area">
<div class="head_title text-center wow fadeInUp">
<h2>Contact us</h2>
<h3> </h3>
<h3 class="text-uppercase">Office</h3>
<span>〒466-0834 Hirojichō, Umezono </span>
<span> Nagoya City Aichi.</span>
<span> Japan</span>
<p class="text-lowercase" style="color:#26B897;">[email protected]</p>
<p> </p>
<p> Sab-AI Lab 愛知県 名古屋市 昭和区 広路町字梅園 10-4</p>
</div>
</div>
</div><!-- End of row -->
</div><!-- End of container -->
</section><!-- End of Contactus Section -->
<!-- Contact Section -->
<footer id="footer" class="footer">
<div class="container">
<div class="row">
<div class="main_footer">
<div class="col-sm-2">
<div class="footer_logo">
</div>
</div>
<div class="col-sm-8">
<div class="copyright_text margin-top-20">
<p class=" wow fadeInRight animated" data-wow-duration="1s">Made <i class="fa fa-cog fa-spin"></i>
by <a href="https://github.com/Shahabks/Sab-AI-Lab"</a> 2018. All Rights Reserved</p>
</div>
</div>
<div class="col-sm-2">
<div class="footer_socail text-right margin-top-20">
<ul class="list-inline">
<li><a href="https://my-solutionailab.slack.com/messages/CFPMZ4GGY/"><i class="fa fa-slack"></i></a></li>
<li><a href="https://twitter.com/mysolEdTech?lang=en"><i class="fa fa-twitter"></i></a></li>
</ul>
</div>
</div>
</div>
</div><!-- End of row -->
</div><!-- End of container-fluid -->
</footer><!-- End of Footer Section -->
<div class="scrollup">
<a href="#"><i class="fa fa-chevron-up"></i></a>
</div>
</div>
<script src="assets/js/vendor/jquery-1.11.2.min.js"></script>
<script src="assets/js/vendor/bootstrap.min.js"></script>
<!-- Compiled and minified JavaScript -->
<script src="https://cdnjs.cloudflare.com/ajax/libs/materialize/0.97.6/js/materialize.min.js"></script>
<script src="assets/js/jquery.easing.1.3.js"></script>
<script src="assets/js/jquery.magnific-popup.js"></script>
<script src="http://maps.google.com/maps/api/js"></script>
<script src="assets/js/gmaps.min.js"></script>
<script>
var map = new GMaps({
el: '.ourmap',
lat: -12.043333,
lng: -77.028333,
scrollwheel: false,
zoom: 15,
zoomControl: true,
panControl: false,
streetViewControl: true,
mapTypeControl: false,
overviewMapControl: false,
clickable: false,
styles: [{'stylers': [{'hue': '#000'}, {saturation: -200},
{gamma: 0.50}]}]
});
map.addMarker({
lat: -12.043333,
lng: -77.028333
});
</script>
<script src="assets/js/materialize.js"></script>
<script src="assets/js/plugins.js"></script>
<script src="assets/js/main.js"></script>
</body></html>